Vol. XXXII. Part II October, 1941 


BIOMETRIKA 


A JOURNAL FOR THE STATISTICAL STUDY OF 
BIOLOGICAL PROBLEMS 


FOUNDED BY 
W. F. R. WELDON, FRANCIS GALTON anp KARL PEARSON 


EDITED BY 
EGON S. PEARSON 


IN CONSULTATION WITH 
HARALD CRAMER J. B. 8S. HALDANE 
R. C. GEARY G. M. MORANT 
MAJOR GREENWOOD JOHN WISHART 


Reprinted by offset-litho, 1952, 1960 


ISSUED BY THE BIOMETRIKA OFFICE 
UNIVERSITY COLLEGE, LONDON 


AND PRINTED AT THE 
UNIVERSITY PRESS, CAMBRIDGE 


PRINTED IN GREAT BRITAIN 


[Isswed 31 October 1941] 











' 
' 





eS EE 





VotumEe XXXII. Parr II OctToBEr, 1941 





THE LAWS OF CHANCE, IN RELATION TO THOUGHT 
AND CONDUCT 


INTRODUCTORY, DEFINITIONS AND FUNDAMENTAL 
CONCEPTIONS 


BEING THE FIRST OF A SERIES OF LECTURES DELIVERED BY 
KARL PEARSON AT GRESHAM COLLEGE IN 1892 


[It is just fifty years since Karl Pearson took up the part-time appointment of Lecturer in 
Geometry at Gresham College in the City of London. This appointment, which he held 
during the years 1891—4, involved the delivery of certain courses of public evening lectures. 
His first course on ‘The scope and concepts of modern science’, commenced on 3 March 
1891; much of its material was afterwards published as The Grammar of Science. Later 
series of lectures dealt with ‘The geometry of statistics’ and ‘The laws of chance’. The 
lecture printed below was found among Pearson’s papers; it was delivered on 1 November 
1892 and was the first of a series devoted to the theory of probability.—Eb.] 


In everyday life we feel, and justifiably feel, irritated with the man who is 
perpetually asking us to define the words we use. We are wont to reply that we 
use our terms in the ‘ordinary’ or ‘customary’ sense. As a general rule mankind 
understand each other in ordinary intercourse and do not stop to discuss the 
meaning of words. But in important and delicate business or in legal contracts 
the accurate definition of the words employed becomes of the utmost weight. 
Even more urgent still is clear definition in the matter of scientific investigation. 
It will not do here to appeal to that vague or floating sense of a word, which is 
termed the ‘ordinary’ or ‘customary’ one, for hardly any two persons use the 
same abstract word for precisely the same range of ideas. What is still more 
remarkable is the change which the meaning of words undergo in a few genera- 
tions, so that even the language of our grandfathers requires to be read in the 
light of their (and not our) customary use of words. Take words apparently. so 
simple as Nature, Right, Belief, Law, Chance: what a gulf separates the field of 
ideas we associate with these terms in 1892, from that which was their ‘ordinary’ 
or ‘customary’ value some century ago, i.e. in the days of the French Revolution! 
Or, again, how different is the modern scientific use of the words ‘nature’ and 
‘law’ from the sense often to-day put upon them in popular or current language ! 
Indeed I am inclined to think that the irritating person, who insists in everyday 
life on definitions, is after all rather a social blessing than a social nuisance—for 
ip my experience 90% of the wordy discussions which arise in ordinary life are 
due to the fact that the disputants have not first fixed the sense in which they 
are using some fundamental conception. 

In the present course of lectures, which will deal with the theory of probability, 
with chance, luck and the vexed question of the scientific measurement of belief, 


we shall have to be especially careful that we clearly define and appreciate our 


Biometrika xxx 7 








90 The laws of chance 


fundamental conceptions. This insistence on definition must be the starting-point 
of any really scientific discussion, and I want to urge you all to start the study 
of this or any other subject by trying to clearly define its scope and terms. You 
must in this respect ‘list to what the friar preaches and not to what he does’, for 
according to a sharp-eyed reviewer I have myself been guilty of publishing a 
book, in which no definition of chance itself was given! I will endeavour to 
supply that omission in to-day’s lecture. But first I want to point out the rela- 
tion between the subject of my present course and the topics of the two earlier 
ones on the Fundamental Con: :pts of Science and on Statistics. The relationship 
is a very close one indeed, far closer than might be imagined on a cursory 
examination. We shall find that statistics are the practical basis of much, if not 
all scientific knewledge, while the theory of chance is not only based on past 
statistics but enables us to calculate the future from the past, the very essence of 
scientific knowledge. There is a close relation between provable and probable; the 
analysis of 20,000 tosses of a coin will help us to penetrate into the very laboratory 
of Nature whose complexity presents us with results strikingly akin to those of 
a game of chance; while the record of a month’s roulette playing at Monte Carlo 
can afford us material for discussing the foundations of knowledge. That things 
apparently so diverse should be so closely related may strike some of you as 
paradoxical, and indeed the ground we are to venture upon abounds in diffi- 
culties and pitfalls. It is one where criticism and controversy have been very 
rife, but at the same time have been fruitful of results and have contributed'to 
clearness of thought. We shall find well-marked divergencies of opinion, charac- 
terizing two different schools, which push to extremes in different directions. 
Based on the. researches of Laplace and Quetelet, we find De Morgan, John 
Stuart Mill and Stanley Jevons pushing the possibilities of the theory of probability 
in too wide and unguarded a manner; while in the opposite camp we find George 
Boole and Dr Venn taking a severely critical and in some respects perhaps too 
narrow view of them. As in many other cases the safe road is probably the middle 
road, and this road is that which I conceive Prof. Edgeworth of Oxford to have 
pointed out. For those of you who may have time for reading I would strongly 
recommend a comparison of Chaps. x-xm of Stanley Jevons’ Principles of 
Science with Chaps. vi-x1 of Dr Venn’s Logic of Chance and Prof. Edgeworth’s 
Philosophy of Chance published in Mind for 1884. I shall refer to the opinions 
of these writers in the course of our work, but you would find the subject of 
chance as treated by them enticing in the extreme, and they will give you far 
more amply than I can do in these lectures the various features of the controversy. 
While dealing with the subject of books I may also refer to: 
Dre Mora@an: Formal Logic (1857). Here Chaps. 1x—x1 are closely connected 
with the topics of our. first two lectures. 
De Morean: An Essay on Probabilities (1838). This is still a useful and sug- 
gestive little book, although it requires some mathematical knowledge. 


KaRL PEARSON 91 


Wuitworts: Choice and Chance (3rd ed. 1878). An excellent book with which 
to approach the elements of the mathematical theory. 

WESTERGAARD : Die Grundziige der Theorie der Statistik (1890). By far the best 
textbook on the relation of statistics and probability for those who read 
German. 

Now I want to restate in the first place some of the conclusions I placed 
before you in my first course of Gresham Lectures. I do not want you now—any 
more than I did then—to accept those conclusions as your own but rather to 
probe and investigate them for yourselves, and thus ascertain whether they 
form a basis sufficiently sound for the superstructure placed upon them. The 
conclusions to which I want to draw your attention are those concerning the 
material of science, scientific law and cause and effect. In the first place the 
material of science consists of certain groups of sense-impressions, which we 
term phenomena and in which we mark not only a certain permanency but a 
routine. When we find a certain sequence of sense-impressions frequently re- 
peating itself, we speak of any antecedent sense-impression as a cause, any 
subsequent one as an effect. A, B, C, D, E, F being a succession of sense- 
impressions, which repeats itself, A, B, C, D, E are all termed causes of the effect 
F. A scientific law or formula is a statement which enables us to resume or 
describe in brief language a routine sequence—or many such routine sequences— 
of causes and effects. As I pointed out to you, a scientific law does not enforce a 
sequence, it merely describes what takes place. No law of gases causes or enforces 
the boiling of a kettle of water, when placed on the fire; it merely describes how 
it boils. How then do we know that a kettle of water will boil, if placed on the 
fire? The answer is a very simple one, ovr knowledge of what will happen is 
based upon past experience. Statistics of past experience, our own or that of 
other men, are the basis of our knowledge of all cause and effect, of all know- 
ledge of phenomena. Here you have the kernel to statistics as the basis of 
knowledge. The statistics are formed in a rough practical manner, but are none 
the less real for all that. Take any sequence of phenomena such as a kettle of 
water boiling which has been long enough on the fire. We have behind us the 
invariable experience that kettles in like positions do boil, and we say we know 
that this kettle will boil. If it does not we expect that some portion of the 
customary sequence fails, the fire has gone out, there is no water in the kettle, 
or there is somewhere a breach in the ‘group of causes’. But in our statement 
about the kettle there are really two important factors, there are the statistics 
of past experience, and the assumption that these statistics will apply to the 
future. There is no scientific reason why the same groups of causes should aiways 
be followed by the same effect. Indeed, a distinguished American mathe- 
matician, Mr C. Pierce, has gone so far as to support the view that the causes, 
A, B, C, D, E, may be followed by F or G, indifferently. There is no logical or 
intellectual proof that like causes will be followed by like effects. It is purely a 


7-2 








92 The laws of chance 


result of experience. Statistics show us the prevalency of routine in the past, and 
these statistics are the first basis of our knowledge. That what has held in the 
past, will hold for the future, is again a statement for which there is no proof; 
it is the outcome of our experience of what has happened in the pasts, which 
were at an earlier date futures. Hence our inferences with regard to natural 
phenomena are essentially based on statistics—namely statistics of what has 
happened in the past, and the experience that within certain ranges the statistics 
of the past repeat themselves in the future. Now I want you to grasp this point 
very clearly, for we are coming close to the relationship between statistics, 
knowledge, belief and chance. What do we mean when we say that 106 boys are 
born as compared with 100 girls? Or when we assert that such will take place 
next year? Why simply this that the statistics of past years for a very great 
number of births give us boys and girls repeatedly in these proportions, and 
further experience—in other words statistics again—shows us that such 
statistical ratios do not change suddenly and abruptly, the results calculated for 
a period of four or five years, hold very closely for the following four or five 
years. Or, again, when we say that we know that the sun will rise to-morrow 
we are just as much appealing to past experience of the action of the sun and 
past experience of the occurrence of routine, as when we appeal to the statistical 
appearance of births. The law of gravitation does not enforce the rising of the 
sun, it is merely a scientific description of what we observe in the motion of the 
planets. Suppose the sun had not risen on one well-authenticated occasion in 
our experience, and on one only, we should then be slightly less confident in our 
assertions as to its appearance to-morrow. Our knowledge would then have been 
weakened down into some very strong form of belief. The more frequently the 
sun had omitted to rise, the less strong would be our certainty with regard to its 
conduct to-morrow, until we passed through every shade of belief to disbelief 
itself. Or let us take a more tangible and possible case. A friend is leaving us, 
say in Chancery Lane at 4 o’clock in the aftérnoon, and we tell him that he will 
find a Hansom cab at the Fleet Street corner. There is no hesitation in our 
assertion. We speak with knowledge, because an invariable experience has shown 
us Hansom cabs at 4 o’clock in Fleet Street. But given the like conditions 
within reach of a suburban cab-stand, and our statement becomes less definite. 
We hesitate to say absolutely that there will be a cab: ‘ You are sure to find a cab’, 
‘I believe there will be a cab on the stand’, ‘There is likely to be a cab on the 
stand’, ‘There will possibly be a cab on the stand’, ‘There might perhaps be a 
cab’, ‘I don’t expect there'll be a cab’, ‘Its very improbable’, ‘ You are sure not 
to find a cab’, etc., etc. In each and every case we go through some rough kind 
of statistics, once we remember to have seen the stand without a cab; on occa- 
sions few and far between, ‘perhaps on an average once a month’, ‘perhaps once 
a week’, ‘every other day’, ‘more often than not there has been no cab there’. 
Certainty in the case of Fleet Street passes through every phase of belief to dis- 





KarL PEARSON 93 


belief in the case of the suburban cab-stand. If once a month is the very 
maximum of times I have seen an empty cab-stand, my belief that my friend 
will find a cab there to-day is far stronger than if I have seen it vacant once a 
week. A measure of my belief in the occurrence of some event in the future is 
thus based upon my statistical experience of its occurrence or failure in the past. 
When in a wide range of experience there has been no experience of failure, then 
as in the case of the cab in Fleet Street, or in the case of the sun rising to-morrow 
our belief becomes so strong that we speak of knowing. But all this knowing 
really amounts to is a very high, or even the highest possible, degree of probability. 
I know that the three angles of a triangle together make two right angles, for this 
lies in my definition of triangle, and belongs to the field of mental conceptions 
and not to physical phenomena. But of the physical universe I can only say I 
believe such and such things will occur, and the degree of my belief is measured 
in a rough approximate way by the statistics of past occurrence and failure. 

We can see this better, I think, by returning to the definite case of the cab 
on the stand. Once a week on the average of a long experience I have seen the 
stand empty. Thus for every six occasions there is a cab, there is one occasion 
that there is not a cab. Had I sought for a cab at the given hour for a long 
period I should have been successful six times in every seven. We then define 
the ratio of the number of successful instances to the total number of occasions 
as the probability or chance of finding a cab—in this case the chance is 6/7. Thus 
the chance of an event is the numerical measure of past experience. It is based 
essentially on statistical information. How wide must be the range of information 
on which the chance is based we will consider later, for it involves very many 
important points. At present we have the following simple rule: Taking the 
statistics of the occurrence, find the number of favourable instances and divide 
them by the total number of instances and this is the chance of the event. 
Returning to our cab-stand, suppose that only once in four weeks I have seen 
it empty at 4 o’clock on the average, then the chance of the event, finding a cab 
at 4 o'clock is 27/28—i.e. in the long run 27 favourable instances per 28 
occurrences. 

Now, if I have found only one failure in 28 occurrences my hope of finding 
a cab on a particular occasion—my belief in there being a cab—will be far 
greater than if I have.found a failure once in 7 occasions. Thus my belief is in 
some way related to the chance; if I know the chance is greater, my belief is 
greater. Prof. De Morgan has asserted that the proper measure of belief és 
chance, and according to him my belief in the two cases cited above would be 
as 6/7 to 27/28, or as 8 to 9. He thus reaches an exact numerical appreciation of 
belief, what might be termed a scientific measurement of belief. 

This view of De Morgan’s has been severely criticized by Dr Venn. He asserts 
that chance is something objective or physical—is based on statistics of the 
occurrence of a physical phenomena—while belief is something psychical and is 











94 The laws of chance 


largely determined by the emotional and nervous temperament of the individual 
man. In other words it is subjective and not objective. According to Dr Venn 
probability deals with the laws of things, while according to De Morgan prob- 
ability has to do with the laws of our thought about things. 

Now I think we must agree with Dr Venn that it is impossible to set an 
absolute numerical value upon the beliefs of human beings in practical life. No 
one will venture to say that one of his beliefs is exactly nine times as strong as 
another. Perhaps the only practical measure we can form of the strength of 
beliefs is the readiness of men to act upon them, and the impetuous or credulous 
man will risk as much where the chance is small, as the sluggish or sagacious 
man where the chance is large. At an important crisis he will risk finding a cab 
on a chance which would have induced the prudent man to order one beforehand. 
Clearly then as applied to the beliefs of practical men in actual life, Dr Venn is 
right in asserting against De Morgan that we cannot put an exact numerical 
value on belief. On the other hand I think we must question whether chance 
can conveniently be treated as peculiar to things. The means by which statistics 
are taken in practical life are human and they become subjective and individual 
in the process of taking and applying them. Besides this the statistics on which 
the chance may be reckoned are frequently at the option of the particular indi- 
vidual and the chance at once becomes subjective and peculiar to him. Let me 
point out what I mean. A man is tossing a coin in a railway carriage, a country 
lad in the carriage, judging by his experience of coins in the past, is ready to believe 
that the chance of a head is 1/2, i.e. that once in two occasions in the long run it 
will come down head. A scientific man (also without guile!) who has made 
experiments in tossing coins knows that every coin has a slight bias, and that 
there is in all probability a slight fraction of a per cent more heads or tails in the 
long run in the tossing of this particular coin. A man of the world knows that 
the coin-tosser is a swindler, and judges that his coin is loaded, so that the 
chance that it comes down head is very far from a half. And the coin-tosser 
himself? Well, he has no statistics at all of the conduct of this particular coin— 
it may be true or biased or loaded—but being an adept in tossing he can bring 
it down head or tail as he pleases. What are we to say is the chance that this 
coin will come down head? We have no statistics whatever of what happens 
when swindlers toss coins, which may after all unknown to them be loaded! 
Are we to say that the chance is an unknowable quantity, and that we cannot 
make any application of the theory of probability? I am inclined to think this 
would unduly narrow the field of our science. It seems to me that we can and 
should apply our theory to the chances subjectively estimated of each occupant 
of the railway carriage. These chances are based on the subjective experience of 
each individual with regard to coins under like conditions, and they certainly 
are more concerned with the laws under which people think about things, than 
with the laws of things themselves. If the country lad bets on a head the chance 


KARL PEARSON 95 


of a head is for him one-half, for the scientific gentleman it must be a shade less 
or a shade more, for the man of the world it is a very small chance indeed, for 
the swindler it may be a certainty, if he means the lad to win on the first occasion, 
in order to excite him to betting heavier amounts. Now the beliefs of these four 
persons clearly differ in strength and their relative proportions are closely re- 
lated to the individual measurement of the chance, to what we may term the 
subjective chance. I am inclined to think with De Morgan that belief varies very 
closely with the subjective chance, but, this subjective chance depends upon the 
statistics of individual experience, and may differ widely from what we may 
term the objective chance, or the chance based upon statistics of the actual event 
in question and independent of the individual calculator. 

Turn (I hope, for the last time) to our cab-stand and the chance of finding a 
cab on it. Accurate statistics may have been taken of the absence of cabs upon 
it for a long period, perhaps, two or three years. For our present purposes these 
may represent the statistics for the calculation of the ‘objective chance’. I may 
have observed the cab-stand, not very regularly, for a few months, and my result 
is: empty about once a week at 4 o’clock; a friend knows nothing about this 
particular cab-stand, but has formed statistics of suburban cab-stands in general ; 
while another person without paying special attention to suburban cabs has 
formed pretty precise ideas as to cab-stands in London as a whole. The statistics 
of suburban cab-stands in particular, or of London cab-stands in general may 
be wide and accurate, or may be individual and approximate; in either case it is 
a subjective act which classes the particular cab-stand under either of these 
headings, the particular chance selected is the result of individual experience or 
subjective choice. If we ask what is the relation between subjective chance and 
objective chance, I think we can safely say, that while the two often differ 
widely, yet the more deep a man’s experience, the more thorough his observation 
and his knowledge of phenomena, the more closely his subjective statistics will 
fit the objective statistics. He will never, perhaps, make the two coincide, but 
in the long run of practical life his mistakes will be few and tend to balance each 
other; his subjective chance will approximate to the objective chance in Dr 
Venn’s sense. He will know what classes of statistics to apply to individual cases 
with the best results, or in ordinary language, ‘he will be a judge of men and 
things’. If experience of life and acquaintance with fact lead a man’s subjective 
appreciation of chance to approximate to the objective value of chance, may we 
not say that, if belief varies with a man’s subjective view of chance, then 
ultimately it is objective chance which governs belief? 

There is a difficulty here which is I think sometimes overlooked, but which 
seems to me fundamental and we must regard it with a little care. I take a coin 
and I say the chance, when it is tossed, of a head is one-half. Now what exactly 
does this mean? One or other of two things, either: 

(1) I have tossed this same coin 10 or 20,000 times and found practically the 











96 The laws of chance 


same number of heads and tails. Here the subjective and objective chances are 
practically identical. Or: 

(2) I have not tossed this special coin at all, but judge it to be like other 
coins, of which my own rough experience, and that of other men, presents 
practical statistics of the equality in the number of heads and tails obtained in 
a great number of tosses. 

Here the subjective and objective chances may or may not be the same, for 
after all the coin may be loaded or even a double-headed one. But in this case 
also experience of the coin will ultimately bring the subjective and objective 
chances to the same value, be it 1/2 or otherwise. Now let us go a stage further 
and suppose that experience has brought the subjective appreciation of chance 
to its objective value. Would that objective value be a measure of my belief? 
Now there is an assumption here, which I have before referred to, and want you 
now to particularly notice. The chance is really based on past statistics, it is the 
number of successes observed by the total number of trials. 20,000 times the 
coin has been tossed and 10,000 times-—within a few units, perhaps—heads 

. 10,000 
have appeared ; the chance of head is 20,000 re) 
when we say the chance is a half. We refer to the future as well as to the past, 
and we assume that if we were to throw the coin an indefinite number of further 
times, there would be in the long run as many heads as tails. Here is the 
assumption we make when chance is taken as the basis of belief as to the future. 
In other words the statistics of past experience are assumed to be identical with 
the statistics of what will happen in the future. When I say that the chance of a 
head is one-half, that statement is meaningless, if it be considered as referring 
to a single toss, it refers to what I believe will happen on the average in a very 
great number of future throws—i.e. a practical equality of heads and tails. This 
belief is based on two elements, first, statistics of past experience as to tossing 
and secondly the permanence of statistical ratios. 

This latter is a most important element and one which in reality forms a 
large factor of belief. Let us bring this out more clearly by a comparison of one 
or two cases. Statistics of tosses show a coin to be a true coin, to give in the long 
run head as often as tail—chance of a head therefore 1/2. 

Statistics of a certain country show that of 206 births 106 are boys—chance 
of a boy being born 106/206. My statistical experience of a certain cab-stand, 
shows that on an average there is no cab there once a week at 4 o’clock—chance 
of a cab 6/7. 

Now before I apportion my belief of what will happen in the long run in the 
future in these several cases, I have to consider the permanence of these statistical 
numbers, and the only way I can do this is by examining statistics as to the 
permanence of similar numbers. The factor of my belief depending on the 
permanency of the chances is itself rooted in statistics. How often have I found 





r 1/2. But this is not all we mean 





Kart PEARSON 97 


the chanve given by statistics to be constant, how often to change? What indeed 
is the ‘chance’ of a chance changing? 

A coin has given as many heads as tails in the past, why should it not now 
begin to give a vastly greater proportion of heads? The appeal is again to 
experience, and experience tells us that, if a coin be not battered, bent or altered, 
it maintains indefinitely the same chance of a head. 

On the other hand experience tells us that while in vital statistics there is 
scarcely ever an abrupt change, ratios do alter slowly and gradually, the chance 
of a boy being born as calculated from the last few years may hold for the next 
few, but it may vary from decade to decade and century to century. 

Still more may the chance of finding a cab on the stand vary. I may have 
carried out my observations for two or three months, but the completion of a 
new line of railway or a Licensing Act may make a sudden breach of continuity, 
there is much less permanence in statistics of this kind, than in those of coins 
or babies. Clearly the chance determined from past statistics is not the only 
factor in apportioning my convictions as to the future appearance of heads, boy 
babies, and cabs. The chance of the statistics remaining in the future what they 
have been in the past must also be considered and be shown to be the same in all 
the cases where beliefs are compared. 

Thus, I think, we must agree with Dr Venn, although partly on other grounds, 
in recognizing that the chance of an event is not an accurate numerical measure 
of our belief in its occurrence, but on the other hand we may go so far with De 
Morgan as to assert that our belief is strengthened or weakened when the sub- 
jective chance based on our personal knowledge or experience—or on the rough 
and ready statistics of practical life—is increased or decreased. 

We may even go a stage further and construct a model universe in the 
following manner: Suppose a world in which men had such width of experience 
that their subjective appreciation of chance was equal to its objective value, and 
further that in this ideal world statistical ratios retained a permanent value— 
in the manner in which we actually find they do in games of chance—then in 
such a scientific ideal world chance might fairly be considered to measure belief. 

It may be asked what is the use of such an ideal model as this? In the flesh 
and blood men of actual life with their prejudices and half-knowledges the sub- 
jective appreciation of chance diverges often widely from its objective value; 
further in this real world few statistical ratios are actually permanent, they vary 
not only with time, but with the range and limits of our statistics. What then 
can the use of our model be? Well, of much the same use as the political econo- 
mists’ model of society governed by the laws of exchange or value, or the 
physicists’ molecular model of nature. Neither is true to reality, but both serve 
with certain reservations to describe in broad terms the general facts of economic 
and physical phenomena. In the same manner, because in a rough and approxi- 
mate way men’s subjective appreciation of chance does tend in the practical 








98 The laws of chance 


experience of life to approach the objective value—and because in a great 
variety of cases chances calculated on past experience are found to remain 
permanent in the immediate future—so men’s beliefs as evidenced especially 
in conduct do vary with chance; and if chance be not a scientific measure of 
belief, it is yet in the average of men a rough and ready means of gauging, more 
or less accurately, the relative strength of convictions. 

It may seem strange to some of you to be told that chance is the measure of 
past experience. At first there may appear to be a very considerable difference 
between the chance that a boy or a girl will be born and the chance that a head 
or tail will turn up. We are quite ready to admit that statistics are needful in 
order to determine whether more boys or girls are born and so to determine the 
chance of a boy or girl birth. But we are not inclined at first to admit an equal 
necessity in the case of the coin. We are inclined to argue that ‘We see no reason 
why head should occur more frequently than tail’ and then convert this into 
‘There can be no reason why head should occur more frequently than tail’— 
and then ‘Head and tail must be equally frequent’. You will see the weakness 
of this argument at once by applying it to the case of boys and girls. ‘We see no 
reason why more boys should be born than girls.’ ‘There can therefore be no 
reason why more boys should be born than girls,’ and finally ‘No more boys are 
born than girls’. Here statistics step in and upset all our preconceived notions. 
In fact a!l arguments of this kind remind us of the old mediaeval notions of 
physical science, which began by arguing as to what nature ought to do, instead 
of patiently observing what she did do. We may see no reason why head should 
occur rather than tail. But if there were not a very definite reason why head 
or tail should have the preference in each individual throw, then we may be 
quite sure that the coin would balance on its edge and exhibit neither head 
nor tail. 

Mere inspection of a coin would certainly not suffice to tell us that the 
‘chances’ of head and tail are equal. The head is different in shape and appear- 
ance from the tail, and the coin may really be biased by this. Let us get over 
this difficulty by taking a perfectly uniform disk absolutely alike on both sides, 
and let us imagine it thrown up so that neither side has any advantage either on 
leaving our hand or on reaching the ground. Can we say that the chances of 
either side are equal without any appeal to experience—to statistics? The fact 
is that if the two events were absolutely balanced in this manner, if not only we 
saw no reason why one should occur more than the other, but there was no 
reason, then there would be no chance of either event occurring at all. In our 
experience of nature there is no such thing as chance of this kind. The moment 
a coin or a die leaves the hand, its fate is really settled and there is no field for 
the ‘play of chance’ in the obscure sense we have just been referring to. The 
mechanical causes are perfectly definite and the occurrence of head or tail, 
ace or deuce, absolutely certain. It is quite true that these mechanical causes 


oO 


Kari PEARSON 99 


are far too complex, too evenly balanced and too incapable of measurement 
for us to mechanically describe what must happen, and so predict head or tail. 
Mechanically the one or other is predetermined, but the multiplicity of causes 
varying so slightly and yet so effectively from throw to throw leaves us in ignor- 
ance as to the result. If we are merely ignorant as to a result which is mechanically 
perfectly certain, what is the meaning of chance in physical nature? Simply this 
that we aid our ignorance by an appeal to past statistical experience. The chances 
of a coin falling head or tail being equal does not depend on my ignorance of what 
will occur, or on my seeing no reason why head more than tail should occur, but 
on my experience of the statystics of tossing coins. This experience is really summed 
up in the symbolic slang ‘a toss up’—as an expression for an equality of chances. 
I know from my own personal experience and from the common habits of men— 
as well as from the statements of gamblers and others—that loaded coins are 
not of frequent occurrence. Without this experience I could predict nothing of 
the tossing of a coin, it might invariably come down 99 % head; or having fallen 
on the first occasion head or tail, that fall might in itself determine what it 
would do on the second occasion. 

What I have said of tossing a coin holds good for the drawing of black and 
white balls out of a bag. It might seem at first sight that if 50 white and 50 
black balls were put into a bag and well mixed, then, each ball being replaced 
after the drawing, as many white as black balls will be drawn in a large number 
of trials. In other words the chances of drawing white and black balls are equal. 

3ut here again, if our statement is really to mean anything we must be appealing 

to some rough experience of the conduct of balls in bags. It is conceivable that 
the hand might have a preference for black balls, or that white bails would have 
a preference for each other and the bottom of the bag. If it be objected that the 
hand does not detect colour difference, and that gravity acts equally on equal 
balls if they be of different colours, we are at once appealing to a wide statistical 
experience resumed in certain fundamental laws of nature. We have left at once 
the shaky ground of subjective reasoning, and turned to statistics. 

But even in these cases direct experiment comes to our aid and provides the 
statistics, which are only roughly embodied in the everyday experience and 
opinions of mankind. Thus: 

BuFFON tossed a coin 4040 times, there resulted: 1992 heads, 2048 tails, or 
19%, heads, 51% tails. 

QUETELET made 4096 drawings out of a bag containing an equal number of 
black and white balls, there resulted: white balls 2066, black balls 2030, or 
50-4°% white and 49-6 % black. 

Mr GRIFFITH, one of my students, has kindly tossed a penny 8178 times and 
there resulted: 4092 heads, 4086 tails, or 50-04% and 49-96 % tails. 

WESTERGAARD made 10,000 drawings out of a bag containing equal numbers 
of red and white balls well shaken before each drawing after the replacement of 








100 The laws of chance 


the previously drawn ball. He obtained: white balls 5011, red balls 4989, or 
50-11% white and 49-89% red. 

YOUR PRESENT LECTURER tossed (as a holiday task) a shilling 24,000 times. 
He obtained for the first 12,000 tosses: 5981 heads, 6019 tails, or 49-84 % heads, 
50-16 % tails; second 12,000 tosses: 5992 heads, 6008 tails, or 49-933 %, heads, 
50-067 °{, tails. In both series there is a balance in favour of tails: in the first 
of less than 1/6%; in the second of about 1/15 %. 

Taking both series together we have: 24,000 tosses: 11,973 heads, 12,027 
tails, or 49-8875 % heads and 50-1125 % tails. 

To avoid any chance of there being a slight loading in the coin—a very slight 
bias towards tails—let us call the heads, tails and tails, heads in the first 
12,000 tosses, we then find: 12,011 heads and 11,989 tails; or 50-046 % heads and 
49-954 © tails. ? 

Thus to 1/20 of a per cent heads and tails are equal, or to express it in another 
manner there has on the average been only one head too many in 1200 tosses. 
Buffon’s experiments coincide with mine in showing a slight bias in favour of 
tail. 

Finally I have analysed the red and black events in an entire month’s play of 
the roulette tables at Monte Carlo. I find that out of 16,178 throws of the ball* 
8111 fell into a red number and 8067 into a black, or there were 50-14% red 
and 49-86 %, black. 

These experiments amply confirm the rougher statistical experience of man- 
kind as to the equality of chances in tossing, or drawing balls from bags, or 
playing roulette. It is on experience of this kind, on accurate statistical measure- 
ment, not on a priors reasoning or subjective opinion, that the data of probability 
are to be based. 

In my next lecture I shall deal more at length with the nature of the statistics 
by which we supplement our ignorance of what is about to happen. 


* The twenty-seventh figure, 0, was of course omitted to equalize chances. 





101 


MEDICAL STATISTICS FROM GRAUNT TO FARR 
By MAJOR GREENWOOD 


INTRODUCTION 

Unpver the Fitzpatrick Trust, a Fellow of the Royal College of Physicians of 
London is chosen annually by the President and Censors to deliver two lectures 
in the College on ‘The History of Medicine’. I had the honour of being chosen 
for this office in 1940 but, for obvious reasons, the lectures were not delivered, 
and it may be safely assumed that some years will pass before a medical audience 
will have time to attend to the history of a subject the modern practice of which 
does not make a strong appeal to physicians. 

The nature of the intended audience inclined me to stress the medical rather 
than the purely statistical aspects of the story and I have trodden ground over 
which a greater man passed some years ago. I hope that Karl Pearson’s studies 
of some or all of these old heroes will eventually be printed, and I know that my 
slight essays can ill sustain a comparison. But, precisely because they are 
slight and linger over small traits and human oddities, they may, in these times, 
wile away an hour or two. I have eliminated some explanations which no 
statistician or biometrician needs and the medical technicalities are few. Perhaps 
a note on the London College of Physicians as it was in the days to which these 
studies relate should be added. 

The College was more than a century old when John Graunt was born, and 
the corporation consisted wholly of physicians who were Doctors of Medicine of 
Oxford or Cambridge; these were the Fellows. Physicians not Doctors of 
Medicine of Oxford or Cambridge were admissible only to the grade of Licentiate, 
and it was not unti] the nineteenth century, when Farr was a young man, that 
the exclusive privilege of the senior universities was abolished. It was not until 
Farr was a middle-aged man that the College had any direct contact with general 
practitioners of medicine and began to examine persons who did not seek to 
practise solely as physicians. In modern usage the College licence, L.R.C.P. 
(now only granted jointly with the membership of the Royal College of Surgeons, 
M.R.C.S.), is a diploma obtained by a large proportion of general medical 
practitioners in the South of England. Down to Farr’s time, the L.R.C.P. was a 
‘specialist’ diploma and could not have been taken by a general practitioner 
(the apothecary of those days) at all. The old L.R.C.P. is represented by the 
M.R.€.P. of our own time but with this distinction. Now, Fellows (F.R.C.P.) 
are normally chosen from the body of M 8.C.P.’s. In the past only Doctors of 
Medicine of Oxford or Cambridge could be Fellows, and before electien but after 
examination were known as ‘candidates’, not licentiates. The great physician 











102 Medical statistics from Graunt to Farr 


Sydenham was never more than a ljcentiate. He graduated M.B. at Oxford and, 
for some unknown reason, never proceeded M.D. until near the end of his life, 
when he took the higher degree not at Oxford but at Cambridge. 


I. THE LIVES OF PETTY AND GRAUNT 


It is always rash to assign an absolute beginning to any form of intellectual 
effort, to say that this or that man was the very first to fashion some organon 
which has proved valuable. All we are justified in saying is that this or that 
man’s work can be shown to have so directly influenced the thought of his con- 
temporaries or successors that from his day the method he used has never been 
forgotten. It may be that fhe lost works of the school of the Empirics Galen 
despised anticipated the numerical method of Louis—some words of Celsus are 
consistent with the hypothesis. It may be that in the long succession of parish 
clerks who for more than a century transcribed the London Bills of Mortality, 
one or two suggested that these figures might have some other use than that of 
warning His Highness of the need to move into Clean Air. But we do not know. 
We do know that out of the casual intercourse of two Englishmen in the seven- 
teenth century was produced a method of scientific investigation which has 
never ceased to be applied and has influenced for good or ill the thought of all 
mankind. In that sense at least we may fairly hold that John Graunt and 
William Petty were the pioneers not only of medical. statistics and vital statistics 
but of the numerical method as applied to the phenomena of human society. 

John Graunt and William Petty were both of Hampshire stock. Petty was 
of Hampshire birth, born on Monday, 26 May 1623, and was three years younger 
than John Graunt, who was born at the Seven Stars in Birchin Lane on 24 April 
1620. 

Materials for writing Petty’s life are abundant; indeed a good biography of 
him was written nearly fifty years ago by his descendant Lord Edmond Fitz- 
maurice, and since then much of the material used by Lord Edmond has been 
printed. Sources for Graunt’s biography are scanty, the most valuable John 
Aubrey’s brief life of him.* Graunt and Petty became acquainted in or before 
1650. The circumstances of that first acquaintance are interesting to those who 
meditate upon the perepeteia of human fate. It was the contact of client and 
patron. 

John Graunt’s early life and manhood were those of the Industrious 
Apprentice. His father was a city tradesman, who bred his son to the profession 
of haberdasher of small wares. John ‘rose early in the morning to his study 
before shop-time’ and learned Latin and French, but did not neglect his business. 
He was free of the Drapers’ Company and went through the city offices as far as 


* Brief Lives, chiefly of Contemporaries, set down by John Aubrey, between the years 1669 and 
1696, edited by Andrew Clark, Oxford, 1896, 1, 271 ef seg. 


id 


Masor GREENWOOD 103 


common councilman; he was captain and then major of the trained bands (the 
ancestor of the Honourable Artillery Company). At the time of the Great Fire 
he is said to have been an opulent merchant. Even fifteen years earlier he—and 
no doubt his father (1592—1662)—had city influence. At that time a Gresham 
professorship was vacant and a young Dr Petty was anxious to obtain it. This 
young man’s career had been unlike that of an industrious apprentice; it had 
been, even for the seventeenth century, romantic. His father was a clothier in 
Romsey, who ‘did dye his owne cloathes’ in a small way of business. When 
William was a child, ‘his greatest delight was to be looking on the artificers— 
e.g. smyths, the watch-maker, carpenters, joyners etc.—and at twelve years old 
could have worked at any of these trades. Here he went to schoole, and learnt 
by 12 yeares a competent smattering of Latin, and was entred into Greek’ 
(Aubrey, Clark’s edition, 2, 140). : 

But the precocious lad did not find a patron in Romsey and was shipped for 
a cabin boy at the age of fourteen. His short sight earned him a taste of the 
rope’s end, and after rather less than a year at sea he broke his leg and was set 
ashore in Caen to shift for himself. ‘Le petit matelot anglois qui parle latin et 
grec’ attracted sympathy and obtained instruction in Caen. Caen was not a 
famous seat of learning like Leyden or Montpellier, but the Fellows and 
licentiates of the College of Physicians admitted between 1640 and 1700 include 
the names of four persons who studied or graduated in Caen (Nicholas Lamy, 
Theophilus Garenciéres, John Peachi and Richard Griffiths). Petty, however, 
was not then thinking of medicine but mathematics and navigation and eame 
home to join the navy. In what capacity he served is unknown; he merely says 
(in his Will) that his knowledge of arithmetic, geometry, astronomy conducing 
to navigation, etc., and his having been at the University of Caen, ‘preferred me 
to the King’s Navy where at the age of 20 years, I had gotten up about three 
score pounds, with as much mathematics as any of my age was known to have 
had’. His naval career was short, for in 1643 he was again on the continent. 
Here he wandered in the Netherlands and France and studied medicine or at 
least anatomy. He frequented the company of more eminent refugees, such as 
Pell and Hobbes, as well as that of the French mathematician Mersen. He was 
very poor and told Aubrey that he once lived for a week on three pennyworth of 
wdinuts, but on his return to England the three score pounds had increased to 
seventy and he had also educated his brother Anthony. 

At first Petty seems to have tried to make a living out of his father’s business, 
but he soon went to London with a patented manifold letter writer and sundry 
other schemes of an educational character. These occupied him between 1643 
and 1649 and made him acquainted with various men of science, among others 
Wallis and Wilkins, but were not remunerative, and in 1649 he migrated to 
Oxford. 


Petty was created Doctor of Medicine on 7 March 1649 by virtue of a 








104 Medical statistics from Graunt to Farr 


dispensation from the delegates (no doubt the parliamentary equivalent of the 
Royal Mandate of later and earlier times). He was also made a Fellow of 
Brasenose and had already been appointed deputy to the Professor of Anatomy. 
He was admitted a candidate of the College of Physicians in June 1650 (he was 
not elected a Fellow until 1655 and was admitted on 25 June 1658). At Oxford 
he became something of a popular hero by resuscitating (on 14 December 1651) 
an inefficiently hanged criminal, who, condemned for the murder of an illegifi- 
mate child, is said to have survived to be the mother of lawfully begotten 
offspring. 

Academically Petty rose to be full Professor of Anatomy and Vice-Principal 
of Brasenose. It is at this point (as usual the precise dates are dubious) that he 
became a candidate for a Gresham professorship and made contact with John 
Graunt. 

Although, as I have said, the niaterials for a biography of Petty are abundant, 
all we know of his early years comes from himself or from friends of later life 
who knew no more than he told them. We have no independent means of 
judging the extent of his culture. There is good evidence that he knew more 
Latin than most Fellows of the College of Physicians know now; none that he 
was an exact scholar (indeed we have his own word, which I am not prepared 
to gainsay,* to the contrary). He was certainly admitted to friendship by some 
men, such as Wallis and Pell, who were serious mathematicians, as by others, 
such as Hobbes, who were not. But whether he could fairly be called a mathe- 
matician is doubtful. Of his medical knowledge we know little.. He left medical 
manuscripts, but these are still unpublished ; of his clinical experience we know 
nothing. 

Petty told Aubrey that ‘he hath read but little, that is to say, not since 
25 aetat., and is of Mr. Hobbes his mind, that had he read much, as some men 
have, he had not known as much as he does, nor should have made such dis- 
coveries and improvements’. But it is at least certain that he made a favourable 
impression upon men who had read a good deal and that the young Dr Petty of 
1650 was thought a promising man. Still it had been an odd career and one 
wonders what a steady business man in the city of London thought of it. 

Why the anatomy professor who had resuscitated half-hanged Ann Green 
should be made a professor of music is not obvious, and if the Gresham appoint- 
ments were jobs, why should the job be done for Petty? The modern imaginative 
historian might suggest various reasons. For instance, that Petty made a 

* If No. 88 of T'he Peity Papers (2, 36) is a typical example of Petty’s Latin Prose style, there 
is not much to be said for it. Here is an example:_‘An dulcius est humanae naturae permultos 
suam potestatem in unum quendam et in perpetuum transferre, id est pendis amittere quam ipso 
puel deindem servare, vel paulatium et in breve tempus irogare, a seipsis demo reformendam et 
disponendam alioquin pro ut, mutato tam rerum quam animi indies suaserit?’ Some of the 


gibberish may be due to the editor’s failure to decipher the handwriting, but no emendation could 
twist this into unbarbaric prose. 


\ 


oO FS & 


Masor GREENWOOD 105 


conquest of Graunt, perhaps had Hampshire friends who were friends of the 
Graunt family, perhaps talked about political arithmetic. We have no evidence 
at all. If the Gresham Professor of Music had duties, Petty did not perform them ; 
about the time of his appointment he obtained leave of absence from Brasenose 
and within a year (in 1652) had left for Ireland, where he was to be very busy 
for some time to come and to make, or found, his material fortunes. 

Macaulay (chap. m1) says that at the end of the Stuart period the greatest 
estates in the kingdom very little exceeded twenty thousand a year. 


The Duke of Ormond had twenty-two thousand a year. The Duke of Buckingham, 
before his extravagance had impaired his great property, had nineteen thousand six 
hundred a year. George Monk, Duke of Albemarle, who had been rewarded for his eminent 
services with immense grants of crown land, and who had been notorious both for covetous- 
ness and for parsimony, left fifteen thousand a year of real estate, and sixty thousand 
pounds in money, which probably yielded seven per cent. These three Dukes were supposed 
to be three of the very richest subjects in England. 


In 1685 Petty made his Will. This Will is a curiously interesting document, 
because it is also an autobiography. It is rich in arithmetical statements and, 
like much of Petty’s arithmetic, the statements may be optimistic. Petty’s final 
casting of his accounts is in this fashion: ‘Whereupon I say in gross, that my 
reall estate or income may be £6,500 per ann. my personall estate about £45,000, 
my bad and desparate debts, 30 thousand pounds, and the improvements may 
be £4000 per ann., in all £15,000 per ann. ut supra.’ 

The details of the calculation are perplexing enough; still if the above cited 
dukes were the richest subjects of the king and if (Macaulay) ‘the average income 
of a temporal peer was estimated by the best informed persons, at about three 
thousand a year’, Sir William Petty, of the year 1685, had travelled as far from 
the young Oxford professor of 1650 as that budding physician from the little 
English cabin boy who spoke Latin and Greek, in Caen, in 1638. The details of 
the fortune-building are not our concern. The shortest account is Petty’s own in 
his Will. He says that by the end of his Oxford career he had a stock of four 


hundred pounds and received an advance of one hundred more on setting out 
for Ireland. 


Upon the tenth of September, 1652, I landed att Waterford, in Ireland, Phisitian to the 
army, who had suppressed the Rebellion began in the year 1641, and to the Generall of the 
same, and the Head Quarters, at the rate of 20s. per diem, at which I continued, till June, 
1659, gaining by my practice about £400 per annum, above the said sallary. About 
September, 1654, I, perceiving that the admeasurement of the lands forfeited by the fore- 
mentioned Rebellion, and intended to regulate the satisfaction of the soldiers who had 
suppressed the same, was most insufficiently and absurdly managed, I obtained a contract, 
dated the llth. of December, 1654, for making the said admeasurement, and by God’s 
blessing so performed the same as that I gained about nine thousand pounds thereby, which 
with the £500 above mentioned, my sallary of 20s. per diem, the benefit of my practice, 
together with £600 given me for directing an after survey of the adventrs lands, and £800 
more for 2 years sallary as Clerk of the Councell, raised me an estate of about thirteen thou- 
sand pounds in ready and reall money, at a time, when, without art, interest, or authority, 

Biometrika xxx11 8 











106 Medical statistics from Graunt to Farr 


men bought as much lands for 10s, in reall money as in this year, 1685, yield 10s. per ann. 
rent above his Maties quitt rents (The Life of Sir William Petty, by Lord Edmond Fitz- 
maurice, London 1895, p. 319). 


No one would willingly rake over the embers of Irish history—still glowing 
after nearly three hundred years. Petty believed himself to be a good man 
struggling against adversity and a public benefactor treated with gross injustice 
to the day of his death. Lecky (History of Ireland, vol. 1, chap. 1, p. 111 of 
popular edition) took a less favourable view. Even if the subject were relevant 
to my undertaking, which it is not, I have not the training in historical research 
to justify me in writing about it. There are, however, some points of psycho- 
logical interest. 

Petty did not, like his contemporary Thomas Sydenham, actually take up 
arms against the king, but he was even more plainly a protégé of the king’s 
enemies. Sydenham’s military career was unimportant; there is no reason to 
believe that he ever exchanged a word with a member of the Cromwell family. 
Petty was the confidential adviser and close personal friend of Henry Cromwell ; 
his services to the Commonwealth authorities were the foundation of his fortune. 
Like many people who have social gifts he had the gentle art of making enemies. 

Pepys, Aubrey and Evelyn concur in the judgment that Petty was a most 
entertaining companion. Evelyn says he was a wonderful mimic. He could 
speak ‘now like a grave orthodox divine; then falling into the Presbyterian way ; 
then to Fanatical, to Quaker, to Monk, and to Friar and to Popish Priest’. The 
gift he exercised among his friends. 

My Lord D. of Ormond once obtained it of him, and was almost ravished with admira- 
tion; but by and by he fell upon a serious reprimand of the faults and miscarriages of some 
Princes and Governors, which, though he named none, did so sensibly touch the Duke, 
who was then Lieutenant of Ireland, that he began to be very uneasy, and wished the spirit 
layed, which he had raised; for he was neither able to endure such truths, nor could but be 
delighted. At last he turned his discourse to a ridiculous subject, and come down from the 
joint-stool on which he had stood, but my lord would not have him preach any more 
(Evelyn). 

My lord Duke was not the first or last person to fail to relish a joke against 
himself. 

In The Londoners a challenged party names garden hoes as the weapons. 
That was Mr Robert Hichens’s fun. In real life, Petty, challenged to mortal 
combat by a Cromwellian soldier, pleaded his myopia and demanded that the 
duel should take place in a cellar and the weapons be axes. 

A raan like this makes friends or at least admirers, also enemies. Long before 
the king enjoyed his own again, Petty had a host of enemies. When the king 
returned, one might have expected that Petty’s position would be critical. 
According to his own account he did lose something, but he was knighted and the 
losses, such as they were, did not seem to stay the growth of his fortune. At the 
Restoration he was already prosperous and he died wealthy: Perhaps the 





Masor GREENWOOD 107 


explanation is that Petty was really as great a public benefactor as he thought 
he was. Perhaps the reason is personal. King Charles loved wits (in the old and 
new sense of the word) and Petty was a wit. The scanty specimens of what 
Petty’s modern representative calls ‘Rabelaisian’ printed from the Petty papers 
would not have appealed to such a connoisseur in this genre as the king—we 
know from Halifax that the king liked to be the raconteur in this field and indeed 
repeated himself often—but he would have relished a good mimic. Still more 
important might have been their common virtuosity. 

Charles was interested in experimental science, and although Petty certainly 
knew more than the king, he may not have known very much more. Neither 
Charles nor James would have been able to find more common ground with 
Isaac Newton than in a later age Bonaparte found with Laplace. But the 
ingenious Dr Petty, who had resuscitated half-hanged Ann Green (which would 
be a capital story if well told), invented an unsinkable ship, had a dozen plans 
for doubling the king’s revenue, and knew something of everything, probably 
did more than Wilkins to interest the king in the new society of virtuosos (how 
the king must have relished the story of the planting of horns in Goa*), and he 
may incidentally have interested the king in his business affairs. This is all 
speculation; what is sure is that when Petty was back in London and able to 
renew personal intercourse with John Graunt, their relation was no longer that 
of client and patron. For a few years more, Graunt was to be a solid merchant, 
but before long Petty was the patron and Graunt the ciient. 

At this point it will be convenient to conclude the biographical facts relating 
to Graunt. I take them mainly from Aubrey. 

Graunt continued to be a prosperous city tradesman for many years after 
his first meeting with Petty. ‘He was’, says Aubrey, ‘a man generally beloved ; 
a faithful friend. Often chosen for his prudence and justice to be an arbitrator; 
and he was a great peace-maker. He had an excellent working head, and was 
facetious and fluent in his conversation.’ Pepys thought as well of Graunt as did 
Aubrey, admiring both his conversation and his collection of prints—‘the best 
collection of anything almost that ever I saw’. 

From the Restoration for several years Graunt figures in London intellectual 
society (he was elected F.R.S. in 1663), but a material calamity was at hand. 
The Fire of 1666 no doubt caused Graunt direct financial loss; this might have 
been repaired. But, although brought up in Puritan ways, ‘he fell’, to quote 
Aubrey, ‘to buying and reading of the best Socinian bookes, and for severall 


* Sir Philiberto Vernatti, Resident in Batavia, had certain inquiries sent him by order of the 
Royal Society. The eighth question was: ‘What ground there may be for that Relation, concerning 
Horns taking root, and growing about Goa?’ This is Sir Philiberto’s answer: ‘ Inquiring about this, 
a friend laughed, and told me it was a jeer put upon the Portuguese, because the women of Goa are 
counted much given to lechery’ (Sprat’s History of the Royal Society of London, 2nd ed. London 
1702, p. 161). 


8-2 








108 Medical statistics from Graunt to Farr 


years continued of that opinion. At least, about. ..he turned a Roman Catho- 
lique, of which religion he dyed a great zealot.’ 

Graunt’s path to Rome was similar to that of young Edmund Gibbon, but 
the results on the career of a city tradesman in the days of Oates triumphans 
were more serious than a visit to Lausanne. Graunt became bankrupt. His 
name dropped out of the list of the Royal Society after 1666, and in 1674 he 
died. There is evidence that in these last years of worldly misfortune, when the 
wheel had come full circle since Graunt had secured the Gresham professorship 
for Petty, Petty helped Graunt. When Petty was in Ireland, Graunt acted in 
some sort as his London agent, and Petty conceived a plan of settling Graunt in 
Ireland. But (we have, of course, only Petty’s word for this) Graunt was not an 
easy man to help; it is possible, of course, that he may have resented Petty’s 
admonitions. ‘You have done amiss in sundry particulars, which I need not 
mention because you yourself may easily conjecture my meanings. However we 
leave these things to God and be mindful of what is the sum of all religion, and 
of what is and ever was true religion all the world over.’ This is an extract from 
a letter of January 1673 to Graunt (The Petty-Southwell Correspondence, p. xxix) 
printed by the late Marquis of Lansdowne. If Lord Lansdowne was right (the 
whole letter is not printed) in thinking this a reference to Graunt’s conversion 
(or perversion) ‘of which’, says Lord Lansdowne, ‘Petty seems to have dis- 
approved on temporal rather than spiritual grounds’, it might have hurt a 
sensitive man. . 

Graunt died on Easter Eve 1674 and was buried the Wednesday following in 
St Dunstan’s church in Fleet Street. ‘A great number of ingeniose persons 
attended him to his grave. Among others, with teares, was that ingeniose great 
virtuoso, Sir William Petty, his old and intimate acquaintance, who was sometime 
a student of Brasenose College.’ Sir William outlived his friend thirteen years 
and lies in Romsey Abbey. Until a descendant in the nineteenth century (the 
third Marquis of Lansdowne) erected a monument, ‘not even an inscription 
indicated that the founder of political economy lay in Rumsey Abbey’ (Fitz- 
maurice, p. 315). 

Graunt had a son who died in Persia and a daughter who, according to 
Aubrey, became a nun at Ghent. Nothing is known of descendants. 

Petty’s widow was raised to the peerage and her elder sons, Charles and 
Henry, died without issue. But the title was revived in favour of the grandson 
of John Fitzmaurice, the second surviving son of Thomas Fitzmaurice, Earl of . 
Kerry, who, as the above-mentioned grandson remarked, had ‘married luckily 
for me and mine, a very ugly woman who brought into his family whatever degree 
of sense may have appeared in it, or whatever wealth is likely to remain in it’. 
This ill-favoured woman was Petty’s daughter Anne, to whom her father wrote: 


My pretty little Pusling and my daughter Ann 
That shall bee a countesse, if her pappa can. 








— ed 


, a. 


Mason GREENWOOD 109 


The cynical grandson was George III’s prime minister and afterwards his béte 
notre, ‘The Jesuit of Berkley Square’ and first Marquis of Lansdowne. 

Of the two friends, one has left an intellectual monument only ; descendants 
of the other have been famous in English history. 

Of these, best known are the first and third Marquises of Lansdowne, 
William (1737-1805) and Henry (1780-1863). Of the first marquis, much better 
known as Lord Shelburne (the title created for Lady Petty), every schoolboy— 
not only Macaulay’s schoolboy—has heard; the quarrel between Charles Fox 
and Shelburne, the party split, the coalition ministry and so on. Schoolboys who 
have reached the sixth and Lecky’s History of England in the Eighteenth Century, 
know a little more. Shelburne, who had much more than a tincture of his 
great-grandfather’s ability and applied himself to economic studies, was one of 
the earliest to appreciate the importance of Adam Smith and was highly thought 
of by two good judges of scientific ability, Benjamin Franklin and Jeremy 
Bentham. 

As a public man, no parliamentary statesman before or since obtained so 
universal a dislike, a positive hatred shared by those who knew him and those 
who did not. 


There is certainly nothing in the actions of Shelburne to justify this extreme un- 
popularity. Much of it was, I believe, simply due to an artificial, overstrained, and affectedly 
obsequious manner, but much also to certain faults of character, which it is not difficult to 
detect. Most of the portraits that were drawn of him concur in representing him as a harsh, 
cynical, and sarcastic judge of the motives of others; extremely suspicious; jealous and 
reserved in his dealings with his colleagues; accustomed to pursue tenaciously ends of his 
own, which he did not frankly communicate, and frequently passing from a language of 
great superciliousness and arrogance to a strain of profuse flattery (Lecky, 5, 136). 


How far some of these characteristics may be recognized in Shelburne’s 
ancestor, we shall inquire in due course. 

The contrast between Malagrida* and his son Henry is shattering. It is this 
Marquis of Lansdowne of whom nearly everybody thinks when he sees the title 
in a book, and rightly so. Walter Bagehot wrote: 


You may observe that when an ancient liberal, Lord John Russell, or any of the 
essential sect, has done anything very queer, the last thing you would imagine anybody 
would dream of doing, and is attacked for it, he always answers boldly, ‘Lord Lansdowne 
said I might’; or if it is a ponderous day, the eloquence runs, ‘A noble friend with whom I 
have had the inestimable advantage of being associated from the commencement (the 
infantile period I might say) of my political life, and to whose advice,’ etc., etc., etc.—and a 
very cheerful existence it must be for ‘my noble friend’ to be expected to justify—(for they 
never say it except they have done something very odd)—and dignify every aberration. 
Still it must be-a beautiful feeling to have a man like Lord John, to have a stiff, small man 


* Malagrida was an Italian Jesuit settled in Portugal who was burned in 1761. The supposed 
jesuitical propensities of Shelburne led to the name becoming his popular title. Hence Goldsmith’s 
unintended mot: ‘Do you know that I never could conceive the reason why they call you Malagrida, 
for Malagrida was a very good sort of man.’ 














110 Medical statistics from Graunt to Farr 


bowing down before you. And a good judge (Sydney Smith) certainly suggested the con- 
ferring of this authority. ‘Why do they not talk over the virtues and excellencies of 
Lansdowne? There is no man who performs the duties of life better, or fills a high station 
in a more becoming manner. He is full of knowledge, and eager for its acquisition. His 
remarkable politeness is the result of good nature, regulated by good sense. He looks for 
talents and qualities among all ranks of men, and adds them to his stock of society, as a 
botanist does his plants; and while other aristocrats are yawning among stars and garters, 
Lansdowne is refreshing his soul with the fancy and genius which he has found in odd places, 
and gathered to the marbles and pictures of his palace. Then he is an honest politician, a 
wise statesman, and has a philosophic mind’, etc., etc. Here is devotion for a carping critic; 
and who ever heard before of bonhomie in an idol? (Bagehot, Works, 2, 64-5). 


Of the father, Atticus (an alias of ‘Junius’) wrote: 


The Earl of Shelburne had initiated himself in business by carrying messages between 
the Earl of Bute and Mr. Fox, and was for some time a favourite with both. Before he was 
an ensign he thought himself fit to be a general, and to be a leading minister before he ever 
saw a public office. The life of this yourig man is a satire on mankind. The treachery which 
deserts a friend, might be a virtue compared to the fawning baseness which attaches itself 
to a declared enemy (Letters of Junius, Wade’s edition, 2, 248). 


Naturally justice was no more to be expected in eighteenth-century news- 
paper diatribes than in the twentieth century, but a clever caricaturist does not 
represent Charles Fox as a living skeleton. Those who attacked the son—there 
were such people—took a different line, as Bagehot hints. Perhaps even in his 
very different character something of the ancestral Petty survives. We shall try 
to discover what this was. ; 

Forty years ago Hull brought out an edition of Petty’s tracts in which he 
included Graunt’s work. In 1927 the fifth Marquis of Lansdowne printed a 
selection from the Petty papers and in 1928 the correspondence between Petty 
and his wife’s cousin,* Sir Robert Southwell (The Petty-Southwell Correspondence, 
edited by the Marquis of Lansdowne, London 1928). 

We shall have to examine in detail both the ‘works’ and the ‘papers’, but, 
as a light upon the character of Petty, the Southwell correspondence is the 
strongest we have. Southwell himself was some generations farther away from 
adventuring than Petty. He came of an ‘undertaker’ stock—the adventurers in 
Ireland of Queen Elizabeth’s time—and his father was vice-admiral of Munster 
before him. He was born in 1635 (died in 1702), regularly educated (Queen’s 
College, Oxford and Lincoln’s Inn), knighted in 1665, for some time Clerk of the 
Privy Council, in the diplomatic service, held other offices, was a member of 
parliament and eventually settled in a country house near Bath. He was 
President of the Royal Society 1690-5. He might be described as a lesser 
William Temple; better educated and less selfish, not so able, but with the same 
cool, cautious judgment; a psychological antithesis of his correspondent. 


* Petty married in 1667 Lady Fenton, widow of Sir Maurice Fenton and daughter of Sir 
Hardress Waller who, knighted in 1629, fought for the Parliament and was one of the King’s 
judges; bo was a major general in Ireland in 1650-1 and a patron of Petty there. 





Mayor GREENWOOD lll 


The correspondence covers the eleven years 1676-87. Both men were, even 
by modern standards, middle aged. They write one to another with complete 
frankness; there is a remarkable absence of the elaborate verbal formalities 
which in seventeenth-century and even eighteenth-century letters are so 
wearisome. 

Petty’s side of the correspondence consists roughly of domesticities 16 parts, 
eager accounts of his quartels and law suits concerning money 40 parts, discussion 
of papers or projected papers 40 parts, add autobiographical boasting to make 
up the 100. 

In the purely domestic part of the correspondence, Petty is seen as a kind, 
good-natured father interested in the doings of his relations by marriage, also 
as a very bad judge of others’ feelings. I remember to have read an unpublished 
letter by the famous Edwin Chadwick, the great and very unpopular sanitarian 
of a century ago. It was written to a friend whose wife had just died of puerperal 
fever. Chadwick expressed regret in the shortest possible formula and assured 
his correspondent that the best solace he could have would be to assist in 
pushing forward a bill (which I think he enclosed) to promote some sanitary 
reform which would have the effect of making it less likely that other men would 
lose their wives in childbed. I remember thinking that, however sensible the 
recommendation, the man who gave it was not likely to bring much comfort to 
his friend. 

Petty was very much like Chadwick here. Southwell lost his wife in 1681 
and Petty condoled with him as follows: 


When your good father dyed, I told you that hee was full of years and ripe fruit, and 
that you had no reason to wish him longer in the paines of this world. But I cannot use 
the same Argument in this Case for your Lady is taken away somewhat within half the 
ordinary age of Man and soon after you have been perfectly married to her; for I cannot 
believe your perfect union and assimulacon was made till many years after the Ceremonies 
at Kinsington. 

What I have hitherto said tends to aggravate rather than mitigate your sorrow. But 
as the sun shining strongly upon burning Coles doth quench them, so perhaps the sadder 
Sentiments that I beget in you may extinguish those which now afflict you. The next Thing 
I shall say is, That when I myself married, I was scarce a year younger then you are now. 
and consequently do apprehend That you have a second Crop of Contentment and as much 
yet to come as ever I have had. 


This remark, curiously enough, was not well received. 


You doe not onely condole the great loss I have sustained in a wife, but you seeme to 
think it reparable....But when by 19 yeares conversation I knew the greate vertues of her 
mind, and discover since her death a more secrett correspondence with Heaven in Acts of 
Pietye and devotion (which before I knew not of), you will allow me, at least for my 
Children’s sake, to lament that they have too early lost their guide. 


Petty could not, it seems, understand that Southwell was wounded and 
returned to the charge in a letter which is lost. That letter provoked a reply 








112 Medical statistics from Graunt to Farr 


which even Petty could not misunderstand and elicited an apology (Correspon- 
dence, p. 90). 

Petty was quite incorrigible. A few years later Southwell had another family 
bereavement and is condoled with in the following terms: 


That by the death of your Father, Mother and Sister, of Sir Edward Deering and your 
three nephews, you are the Head and Governor of both Familyes. That by the death of 
Rupe, Ingenious Neddy culminates; and by that of your Excellent Lady you are entitled to 
that million I mentioned of unmarryed teeming Ladyes. 


Once again, Southwell was not comforted. ‘Cousin, you doe wipe off Teares 
at a very strange rate, but why did nature furnish Them if there must be no 
Sorrow?’ 

Petty had a very quick perception of when and where his shoe pinched, but 
no imaginative sympathy. 

Passing to Petty’s financial affairs and lawsuits, the position was this. By 
original grants, by purchase and in various ways, Petty had widely scattered 
Irish interests. Questions of the validity of the original grants, of rent charges 
due to the crown or to other grantees, of matters of fact and matters of law were 
endless. Petty saw himself steadily as a great public benefactor harassed by 
scoundrels, and it never occurred to him even as a theoretical possibility that 
others had rights. Of his manner of proceeding the editor of the correspondence 
gives a typical example (Correspondence, p. 90). In 1681 Petty gave evidence 
before Lord Chief Baron Hen as to ‘Soldier’s land’ which he had bought in 
Kerry and, it seems, the court decided against him. 


Petty gave vent to his chagrin in a long and scurrilous lampoon against the offending 
judge, entitled: ‘HENEALOGIE or the legend of Hen-Hene and Pen-Hene’, in two parts. 
Whereof the first doth in 24 chapters of Raillery, contain the enchantements, metamor- 
phoses and merry conceits relating to them. The second part contayning (in good earnest) 
the foolish, erroneous, absurd, malicious and ridiculous ‘JUDGEMENTS of HEN-HENE’. 
Fortunately perhaps for the repute of its author, this diatribe was never made public. 


Fortunately, also, for a more material reason; it would probably have led 
to a second incarceration for contempt of court. 

Southwell evidently viewed his good cousin’s proceedings with a mixture of 
gentlemanlike annoyance and practical minded contempt. He expressed these 
feelings more than once; the following extract from a letter of 1677 is typical; 
the particular suit in progress to which reference is made was a claim for £5000 


in respect of a sum of £2500 actually advanced by Petty to the Farmers of 
Revenue. 


And suffer from me this expostulation, who wish your prosperity as much as any man 
living; and having opportunities to see and heare what the temper of the world is towards 
you, I cannot but wish you well in Port, or rather upon the firm Land, and to have very 
little or nothing at all left to the mercy and good will of others. For there is generally 
imbibed such an opinion and dread of your superiority and reach over other men in the 
wayes of dealing, that they hate what they feare, and find wayes to make him feare that is 





aevwvrIT-UCUN STS BS 


Masor GREENWOOD 113 


feard. I doe the more freely open my soul to you in this matter, because tis not for the 
vitells that you contend, but for outward Limbs and accessions, without which you can 
subsist with Plenty and Honour. And therefore to throw what you have quite away, or at 
least to put it in dayly hazard onely to make it a little more than it is, Is what you would 
condemne a thousand times over in another. And you would not think the Reply sufficient 
that there was plain Right in the Cause and Justice of their side, for iniquities will abound 
and the world will never be reformed. 

After all this is said, I mean not that you should relinquish the pursute of your 2500£, 
which is money out of your Pockett and for which you are a Debtor unto your Family. 
But for other pretensions, lett them goe for Heaven’s Sake, as you would a hott coale out 
of your hand: and strive to retire to your home in this Place, where you had the respect of 
all, and as much quiet as could be in this life, before your medling with that pernicious 
business of the Farme. 


There is no reason to suppose that Petty ever took such sensible advice. Yet, 
somehow, he kept his head well above water. 

In the later part of the correspondence Petty indulges in that complacent 
financial retrospect which he inserted in his Will and I have, perhaps too harshly, 
described as autobiographical boasting. It is possible that Southwell had heard 
of these financial triumphs rather often; at least there is a hint of this in the 
following : 

I will onely note that since you are soe Indulgent as to think me worthy of being your 
Depositary in this great Audit, and expect by the Course of Nature that I should speake 


when you are Silent, you must allow me liberty without blame to aske questions when you 
seeme defitient or Redundant. 


That you are defitient may be suggested when, on the fortunate syde, I find noe Item 
for my Lady or of the hopefull stock she has brought you (p. 227). 

The shrewd thrust of the last sentence was deadly. The subject does not 
recur. 

I have indicated the character of the non-scientific part of the correspondence 
because we must examine Petty’s scientific writings in greater detail. I think, 
however, we have enough to justify a provisional diagnosis of Petty’s psycho- 
logical type. 

In literature and in life the perennial boy is often encountered. But while 
Peter Pan and Mr Reginald Fortune make far more friends than foes, that is not 
so true of their living counterparts. The exuberant flow of ideas and schemes, the 
intense and restless interest in everything which is characteristic of the clever 
child, often is extraordinarily attractive when it is associated with and con- 
trolled by the trained intelligence of a man. But the bad as well as the good 
points of a childlike or adolescent soul* are to be brought into the account. The 


* The first Marquis of Halifax said of King Charles that ‘his inclinations to love were the effects 
of health and a good constitution, with as little mixture of the seraphic part as ever man had’, and 
Petty held that the King was typical. In The Petty Papers (no. 93 of vol. 2) there is a memorandum 
headed ‘Californian Marriages with the Reasons thereof’. ‘In California’, says Petty, ‘6 men were 
conjugerted to 6 women in order to beget many and well conditioned children, and for the greatest 
venereall pleasure, in manner following, viz.’ 

He then sets out the plan. One man ‘excelling in strength, nimbleness, beauty, wit, courage 








114 Medical statistics from Graunt to Farr 


clever child is often naively and intensely selfish, and so remains as the eternal 
boy; his quite crude and unashamed egoism, his inability to understand that 
others have feelings and even rights, repel as strongly as his intellectual freshness 
attracts. How far he is a success in life depends on which way the balance 
turns. 

Petty seems to me a good example of this psychological type; its good points, 
the restless energy and exuberant flow of ideas, were sources of strength in such 
a time as that of the Civil War and Restoration, which, particularly the Restora- 
tion period, was in virtues and vices an age of grown-up children. Indeed his 
emotional adolescence may have shielded him from the deadly enmity of real 
men. Its bad points made him enemies, but they were children like himself. 
Nearly a century later, in a time of adults, these same characteristics, restless 
intellectual energy and vanity, exhibited by one no longer a rollicking adventurer 
but a great landowner, produced an unfavourable balance and we have ‘Mala- 
grida’. In Malagrida’s son, one has a change; the attractive traits, the eager 
interest in all sorts of things is still there, but the childish hungry vanity has 
been softened or sublimed. The cynic may say that it was easy for a great Whig 
lord i50 years ago to be agreeable, to keep himself hors concours ; perhaps it was, 
although the Dropmore Papers raise doubts. The fact, however, is certain. In the 
third Lord Lansdowne one sees the good and in the first the bad effects of the 
perennial boyishness of the ancestor. The ancestor lived in a state of society 
where the good points outweighted the bad points. That is why, although he 
made enemies and was often vexed, he was able to view his career with com- 
placency and to bequeath a great fortune. But it is not Petty as a man but 
Petty as a scientific worker who is the proper object of my study. 

How far does the psychological make-up which, as I think, characterized 
Petty conduce to scientific investigation? We might expect that it would be an 
immense stimulus to pioneering, that such a man would direct attention to a 
number of problems which deserved study, but that it would not lead to the 
production of any solid contribution to knowledge. Our task is to examine in 
some detail Petty’s scientific work. 


and good sense’ subsequently called the Hero, is allowed four women for his sole use. One Great 
Rich Woman is allowed five men who are to serve her when she pleases, but another woman is 
allotted to the five men jor use in common by the five. 

It may be said this fable is only an after dinner jest—perhaps that is the whole explanation. 
But Petty does go to the trouble of financial calculations, and does seem to suggest a serious con- 
sideration. (‘The encrease of children will be great and good.’ ‘No controversy about joynture, 
dower, maintenance, portion etc.’) Nobody emotionally adult would be likely to make Californian 
Marriages a basis for practical statecraft. 


pat 
1 is 


on. 
on- 
ire, 
ian 


Masor GREENWOOD 115 


II. PETTY’S SCIENTIFIC WORK 

It is no part of my undertaking to survey the whole of Petty’s scientific 
activities, but to speak only of his medical and vital statistical work. 

In Hull’s edition of Petty’s writings, the editor discusses Petty’s status as an 
economist and remarks that Petty’s view that value depended upon labour was 
probably derived from Hobbes. The corn rent of agricu'tural lands was in Petty’s 
view determined by the excess of their produce over the expenses of cultivation, 
paid in corn, and the money value of the excess will be measured by the amount 
of silver which a miner, working for the same time as the cultivator of the corn 
land, will have left after meeting his expenses with a part of the silver he secures 
(Hull, p. xxiii). Why there should be any surplus, he explains by density of 
population. 

Prof. Hull refrained from attempting to assess Petty’s work in terms of 
modern economic theory. A mere medical statistician will naturally follow this 
example. More than a century ago, Mr Chainmail had learned from Mr MacQuedy 
that the essence of a safe and economical currency was an interminable series of 
broken promises and added: ‘There seems to be a difference among the learned 
as to the way in which the promises ought to be broken; but I am not deep 
enough in their casuistry to enter intc such nice distinctions.’ Medical statisti- 
cians may well adopt Mr Chainmail’s modest attitude towards the whole field 
of economic theory. Confining ourselves to statistics, we must consider what 
Petty thought should be done and what he actually did himself. 

Under the first heading, praise can be unstinted. More than 150 years before 
the establishment of the General Register Office, Petty specifically proposed the 
organization of a central statistical department the scope of which was wider 
than that of our existing General Register Office. It was to deal not only with 
births, marriages, burials, houses, the ages, sexes and occupations of the people, 
but with statistics of revenue, education and trade (see T'he Petty Papers, 1, 
171-2). He did not confine himself to vague recommendations, but drew up an 
enumeration schedule to be used for each parish. On this was to be entered: 
The number of housekeepers and of houses; the number of hearths; the number 
of statute acres; the number of people by sex and in age groups, viz. under 10, 
between 10 and 70, over 70; for males those aged 16 to 60, and for females those 
between 16 and 48 and how many of these latter were married; how many 
persons were incurable impotents and how many lived upon alms. This, it will 
be noted, is a better enumeration schedule than any used in England before the 
census of 1821. Further in his notes (printed in The Petty Papers) are various 
suggestions for the utilization of data collected in this way. 

The most striking is this: ‘The numbers of people that are of every yeare old 
from one to 100, and the number of them that dye at every such yeare’s age, 
do shew to how many yeare’s value the life of any person of any age is equivalent 











116 Medical statistics from Graunt to Farr 


and consequently makes a Par between the value of Estates for life and for 
years’ (The Petty Papers, 1, 193). 

This is, I think, the most remarkable thing Petty ever wrote, for it suggests 
that he had grasped the principle of an accurate life table, viz. a survivorship 
table based upon a knowledge of rates or mortality in age groups. No such table 
was constructed from population data until the end of the eighteenth century, 
because until then data of the age distribution of the living population were not 
obtained. Whether Petty also realized that under certain conditions a life table 
could be constructed without knowledge of the ages of the living population is 
a controversial matter which I shall discuss later on. 

Then he makes suggestions which are relevant enough to modern demo- 
graphic problems. 


By the proportion between marriages and births, and of mothers to births, may be 
learnt what hindrance abortions and long suckling of children is to the speedier propagation 
of mankind; as also the difference of soyles and ayres to this foecundity of women. 

By the proportion between maryd and unmaryd teeming women, may be found in what 
number of yeeres the present stock of people may bee encreased to any number assigned 
answerable to the defect of the peopling of the nation for strength or trade. 


There are not wanting some suggestions which imply that even if Petty’s 
opinion of the Faculty were higher than that of Sydenham (whom we honoured 
posthumously) it was tinged with scepticism. 


Whether they [viz. fellows and licentiates of the College of Physicians] take as much 
medicine and remedies as the like number of any other society. 
Whether of 1000 patients to the best physicians, aged of any decade, there do not die 
as many as out of the inhabitants of places where there dwell no physicians. 
Whether of 100 sick of acute diseases who use physicians, as many die and in misery, 
as where no art is used, or only chance. (The Petty Papers, 2, 169-70.) 


This statistical experiment has not yet been performed and indeed might be 
hardly so conclusive as Petty implied. 

When one passes from what Petty suggested to what he actually did himself, 
our praise must be qualified. As Prof. Hull said, he was ‘more than once misled 
into fancying that his conclusions were accurate because their form was 
definite’. 

In judging Petty it is but fair to contrast him with College contemporaries 
whose names are more honoured by us. Among his contemporaries in the 
College were Thomas Browne and Thomas Sydenham. Browne was a much 
older man than Petty, Sydenham almost his coeval. Of Browne’s quality as a 
physician we know nothing; but his literary influence indirectly—through 
Samuel Johnson—and directly upon generations of readers has been greater than 
that of any other practising medical man. Browne, like Petty, had an enormous 
range of interests and his book learning was greater. But, as we shall see, when 


— Ww 


Masor GREENWOOD 117 


he tackles a problem of demography, Petty’s rashest guesses seem by com- 
parison as soberly scientific as an annual report of the Registrar-General. 

Sydenham was an iconoclast in clinical practice and believed himself to be 
emancipated from the rule of ancient authority. No fantastic arithmetical 
calculations are to be found in Ais writings. In fact, with a single exception 
(Observations Medicae, 2, i), no arithmetic at all. It never seems to have entered 
his mind, although his greatest work purports to give the history of the diseases 
in London through a generation, that the arithmetical statements of the London 
Bills of Mortality were of any value whatever. 

Sydenham was too wise a man for us to think that he rejected the evidence 
because the data were compiled by illiterate old women. He would have known 
that the sworn searchers had the loquacity of their sex and rank and were likely 
to ask what ‘the doctor said’. He rejected it, because counting and measuring 
things did not come within his purview, just as the first beginnings of pathology 
and medical chemistry seemed. to him irrelevant. 

For the most part, Petty’s statistical work was severely practical, but there 
is one excursion into theory which is interesting. It is to be found in a section of 
his tract on the use of what he calls Duplicate Proportion and is reprinted by 
Hull (pp. 622-3). 

Petty states that there are more persons living between the ages of 16 and 
26 than in any other decade of life. The statement is not true for modern 
populations and was probably not true for the English population of Petty’s 
time. In 1861-71 (before the fall in the birth rate and infant mortality rate) 
there were 5-4 millions living under 10, and 4-0 between 15 and 25). But perhaps 
Petty meant that there were more living in the decade 16 to 26 than in any later 
decade, in which case his statement was of course right unless the birth rate was 
falling. 

He then asserts that the 

Roots of every number of Men’s Ages under 16 (whose Root is 4) compared with the 
said number 4, doth show the proportion of the likelyhood of such men reaching 70 years 
of Age. As for example: ‘Tis 4 times more likely that one of 16 years old should live to 70, 
than a new born Babe. ‘Tis three times more likely, that one of 9 years old should attain 
the age of 70, than the said infant. Moreover, ’tis twice as likely, that one of 16 should reach 
that Age, as that one of four years old should do it; and one third more likely, than for 
one of nine. 

We have no life table for England in 1674. Perhaps the nearest modern 
experience might be the Liverpool Table calculated by Farr seventy years ago. 
According to that table the chance of a new-born child living to be 65 was 
0-0976 and the chance of a person of 15 living to 65 was 0-202, which is about 
double the infant’s chance, not four times as large. For the Healthy Districts, 
the chances are 0-4246 and 0°54585; that is, in a ratio of 1-28 to 1. 

Petty’s statements are wildly wrong. The interesting point is how did he 
reach them? The only figures he had were printed by Graunt. 











118 Medical statistics from Graunt to Farr 
This ‘Life Table’ gives 1, as follows: 


c 100 eo 
1, 64 ig 6 
1 i6 40 leg 3 
lag 25 a l 


. s 


Now if we take 2 as the survivors to 70 (it does not of course matter what the 
numerator is for comparative purposes), then the infant’s chance of surviving 
to 70 is 0-02 and the person of 16 has the chance 1/20=0-05, a ratio of 2-5, not 
wildly different from the Liverpool Table figure and very different from 4-0. 

A fortiort when Petty, having passed above age 16, asserts that ‘it is five to 
four, that one of 26 years old will die before one of 16; and 6 to 5 that one of 36 
will die before one of 26’, we are in a region of pure fantasy because, even if he 
had had the statistical data, Petty would not have had the technical knowledge 
to solve the problem involved, viz. to find the probability that of two lives aged 
respectively x and y, the former will fall before the latter. 

If we keep within the range of the simple arithmetic which Petty used, the 
result cannot be obtained. 

He then passes to this statement: 

To prove all which I can produce the accompts of every Man, Woman, and Child, 
within a certain Parish of above 330 Souls; all which particular Ages being cast up, and 
added together, and the Sum divided by the whole number of Souls, made the Quotient 
between 15 and 16; which I call (if it be Constant or Uniform) the Age of that Parish, or 
Numerus Index of Longaevity there. Many of which Indexes for several times and places, 
would make a useful Scale of Salubrity for those places, and a better Judg of Ayers than 


the conjectural Notions we commonly read and talk of. And such a Scale the King might 
as easily make for all his Dominions, as I did for this one Parish. 


The puzzle is to discover why Petty thought this statistical experiment 
proved his point and why he regarded the mean age of the population of a parish 
its index of longevity. The first question I cannot answer at all; about the second 
I can make a guess. Jf the parish population were supported solely by births and 
there was no migration, then, if the death rates at ages did not vary, the popula- 
tion would be a stationary population and both the mean age of the living and 
the mean age at death would be constant. The expectation of life is greater than 
the mean age of the living unless the rates of mortality at early ages are very 
high and the more favourable the rates of mortality the greater will be the 
difference. In Petty’s day, when mortality at early ages was very high, the two 
constants were probably not far apart, but it is certain that both expectation of 
life and mean age of a life table population were greater than 16; probably of 
order 28 to 32. 

I think we may be sure that the parish Petty counted was not stationary in . 
the statistical sense, but had an excess of births over deaths, and that his average 
threw no light upon the rates of mortality. 


\ 


Masor GREENWOOD 119 


Passing to practical statistics, it will be convenient first to note rapidly 
statistical observations which are incidental in treatises of primarily financial or 
economic interest. In the Verbum sapienti, which although not printed until 
1691 was written as early as 1665, Petty attempts to reckon what a man is 
worth. Here is the method. He concludes from financial data that the annual 
proceed of the Stock or Wealth of the nation yields 15 millions, but that the 
expenses of the nation are 40 millions. So the balance of 25 millions must be 
derived from the labour of the people. He assumes that the population is 6 
millions and that half of these can work, and earn £8. 6s. 8d. a head per 
annum. This would be 7d. a day, abating 52 Sundays and half as many other days 
for sickness, holidays, etc. ‘Whereas the Stock of Kingdom, yielding but 15 
Millions of proceed, is worth 250 Millions; then the People who yield 25, are 
worth 416 2/3 Millions. For although the Individuums of Mankind be reckoned 
at about 8 years purchase; the Species of them is worth as many as Land, being 
in its nature as perpetual, for ought we know.’ 

Why an individual’s working life is worth only 8 years’ purchase is not clear. 
One would be inclined to put it as the average number of years lived in the 
working period of life. Perhaps Petty took Graunt’s table and worked out the 
average number of years of life lived between the ages of 16 and 56; it ts 
nearly 8. 

He then calculates the money loss due to 100,000 dying of the plague and 
makes it nearly 7 millions, adding that £70,000 would have been well disposed 
in preventing this ‘centuple loss’. Perhaps this is the first printed statement 
of the neglected truth that public health measures pay. 

Since Petty’s day, others, including Farr himself, have done sums of this 
kind; it is a popular occupation in the United States of America. 

Farr went to work more elaborately, making out a balance sheet of a man 
from the cradle to the grave. But the principle was much the same. We cannot 
say it is a wholly useless pastime. There is of course the difficulty that if more 
lives are saved the price of labour might fall. But to Petty that would have been 
no difficulty, because he held that wealth is purely relative, viz. that if the income 
of each person in a community is halved, everybody is as well off as before. 

In the Political Anatomy of Ireland, Petty seeks to determine war losses in 
Ireland. 

The number of the People being now Anno 1672 about 1,100,000 and Anno 1652 about 
850M. Because I conceive that 80 M. of them have in 20 years encreased by generation 
70 M. by return of bariished and expelled English; as also by the access of new ones, 
80 M. of New Scots, and 20 M. of returned Irish, being all 250 M. 

Now if it could be known what number of people were in Ireland Ann. 1641, then the 
difference between the said number, and 850, adding unto it the increasé by generation in 
11 years will shew the destruction of people made by the Wars, viz. by the Sword, Plague 
and Famine occasioned thereby. : 


I find by comparing superfluous and spare Oxen, Sheep, Butter and Beef that there was 
exported above 1/3 more Ann. 1664 than in 1641, which shews there were 1/3 more of 








120 Medical statistics from Graunt to Farr 
people, viz. 1,466,000. Out of which Sum take what were left Ann. 1652, there will remain 
616,000 destroyed by the Rebellion. 

Whereas the present proportion of the British is as 3 to 11; But before the Wars the 
proportion was less, viz. as 2 to 11 and then it follows that the number of British slain in 
11 years was 112 thousand Souls; of which I guess 2/3 to have perished by War, Plague and 
Famine. So as it follows that 37,000 were massacred in the first year of Tumults: So as 
those who think 154,000 were so destroyed, ought to review the grounds of their Opinions. 

It follows also, that about 504 M. of the Irish perished, and were wasted by the Sword, 
Plague and Famine, Hardship and Banishment, between the 23 of October 1641 and the 
same day 1652. Wherefore those who say, That not 1/8 of them remained at the end of the 


Wars, must also review their opinions; there being by this Computation near 2/3 of them; 
which Opinion I also submit. 


Assuming, which is rash, that the estimates of population in 1672 and 1652 
are correct, the assumption that population varied inversely as exportation of 
cattle seems bold. Might it not be that shipping facilities were better in 1664 
than in 1641? Had there been no exportation we could not infer the population 
to be infinite. 

Again Petty has multiplied the estimate for 1672 by 1-333. But he needed 
the population of 1664, which presumably was smaller than that of 1672. If his 
estimate is right, the population was increasing at the rate of about 12-5 
thousands per annum, so he should have multiplied 1,000,000 not 1,100,000 by 
1-333 and has overestimated the 1641 population by 133,330, and therefore the 
number destroyed by the same amount, an overstatement of 20%. But this is 
not all. If we assign the decrement of population between 1652 and 1641 wholly 
to sword, plague and famine, we must assume that births continued at the peace- 
time rate; not a likely assumption. Lastly, it seems unreasonable to assign the 
casualties to the two races in precise proportion to their estimated numerical 
strength in the population of 1641. 

How it follows that 37,000 were massacred in the first year of tumults I do 
not know. 

In a later. work (Treatise of Ireland, pp. 610-11) Petty has another shot at 
this problem. 

He now assumes that Graunt’s deduction from a Hampshire parish register, 
viz. that christenings are to burials in the ratio of 5 to 4, applies to Ireland, and 
that the death rate is 1 in 30, i.e. about what Graunt estimated for London and 
much higher than his estimate for the country. He then proceeds in this way. 
He estimates the population of 1653 to be 900,000 and that of 1687, 1,300,000. 
Then taking 1/30 for the death rate and 1/24 for birth rate, he makes the 
population of 1652, 985,000. He does not comment on the great decrease be- 
tween 1652 and 1653; but there was:still war in Ireland in 1652. 

He now says that the population of 1641 was greater than that of 1687, ‘as 
appears by the Exportations, Importations, Tyths, Grist-Mills and the Judg- : 
ment of Inte!ligent Persons’. This time he takes the population to be 1,400,000— 
a little less than in the earlier estimate—and by the same kind of reasoning 


SS eS | 


Masor GREENWOOD 121 


again makes the war losses to be about 600,000. One is reminded of Hull’s 
remark that Petty confused the accurate with the definite. Also one notes the 
inevitable tendency of a polemical writer—which Petty very decidedly was— 
to maintain his original assertion. Those of us who have never yielded to this 
temptation may cast stones at him. It is not I believe too cynical to say that 
any calculation Petty made would have made the war losses around 600,000. 

Returning to the Political Anatomy of Ireland, we find here a distinct claim 
that the mean age at death (not the mean age of the living) measures longevity. 

As to Longaevity, inquiry must be made into some good old Register of (suppose) 
20 persons, who were all born and buried in the same Parish, and having cast up the time 
which they all lived as one man, the Total divided by 20 is the life of each one with another; 
which compared with the like Observation in several other places, will show the difference 
of Longaevity, due allowance being made for extraordinary contingencies and Epidemical 
Diseases happening respectively within the period of each Observation (p. 172). 

Apart from what we should think the absurdity of basing important con- 
clusions upon an average of 20—and Petty only gives 20 as a figure—the mean 
ages at death of different populations are not comparable unless in each place 
the population is stationary in the sense described above. But, since so acute a 
man as Edwin Chadwick made the same mistake in the nineteenth century as 
Petty in the seventeenth century and it continues to be made in various places 
in the twentieth century, we need not be superior. 

We now come to Petty’s purely statistical work which is concerned with the 
growth of population; before examining this in detail, it will be convenient to 
consider the methods available in the seventeenth century for estimating 
population and notions then current on what may be called the theory of 
population growth. 

It is hard to believe that in the ancient world nobody studied demography 
irithmetically. There is evidence that the Romans enumerated citizens—the 


i 
\ 


d census is pure Latin—and it has been suggested that the Romans made 
life tables. Gouraud, cited by Todhunter (History of the Mathematical Theory of 
Probability, p. 14), refers to a passage cited from Ulpian in the Digest which I 
have discussed elsewhere.* The question was of the value of annuities and the 
conclusion I reached was that Ulpian had no vital statistical basis whatever for 
his figures, that he simply began with the capital value the law gave for any 
usufruct and then, realizing that people do die eventually, made some sub- 
tractions, ending with the absurd (vital-statistically speaking) conclusion that 
after the age of 60 the rate of mortality was independent of age. 

There is not, I think, any reason to believe that the practical Romans had 
anticipated Graunt and Petty. 

That is not to say that nobody studied any demographical problems arith- 
metically. Indeed one fellow of the College of Physicians who has had—and will 


* Journ. Roy. Stat. Soc. 103 (1940), 246 


Biometrika xxx 











122 Medical statistics from Graunt to Farr 


continue to have—a hundred readers for every one reader of Graunt and Petty 
made an elaborate demographical calculation. This was Sir Thomas Browne. 
Sir Thomas devoted the sixth chapter of the sixth book of Pseudodoxia to the 
vulgar opinion that the earth was slenderly peopled before the Flood. 

This vulgar opinion Sir Thomas found to be very wide of the mark. Indeed, 
far from the earth being slenderly peopled, ‘we shall rather admire how the 
earth contained its inhabitants, than doubt its inhabitation: and might con- 
ceive the deluge not simply penall, but in some way also necessary, as many 
have conceived of translations, if Adam had not sinned, and the race of man had 
remained upon earth immortal’. Indeed Sir Thomas estimates that by the 
seventh century of the world’s history its population amounted to 1,347,368,420. 
He reaches this result in the following way: 


Having thus declared how powerfully the length of lives conduced unto populosity of 
those times, it will yet be easier acknowledged if we descend to particularities, and consider 
how many in seven hundred years might descend from one man; wherein considering the 
length of their dayes, we may conceive the greatest number to have been alive together. 
And this that no reasonable spirit may contradict, we will declare with manifest dis- 
advantage; for whereas the duration of the world unto the flood was about 1,600 years, we 
will make our compute in less than half that time. Nor will we begin with the first man, 
but allow the earth to be provided of women fit for marriage the second or third first 
centuries; and will only take as granted, that they might beget children at sixty, and at 
an hundred years have twenty, allowing for that number forty years. Nor will we herein 
single out Methuselah, or account from the longest livers, but make choice of the shortest 
of any we find recorded in the Text, excepting Enoch: who after he had lived as many years 
as there be days in the year was translated at 365. And thus from one stock of seven hundred 
years, multiplying still by twenty, we shall find the product to be one thousand, three 
hundred forty seven millions, three hundred sixty eight thousand, four hundred and 
twenty. 

1 20. 
2 400. 
3. 8,000. 
4. 160,000. 
Century. 5 3,200,000. 
6 64,000,000. 
7 1,280,000,000. 


1,347,368,420. 


Simply as a sum, there are difficulties about this result. If our 20 are equal 
numbers of males and females, it is not 20 which should be multiplied by 20 but 
10. If they are all males, then women are left out of the reckoning. But, per- 
haps, as the Text does not record the ages of women, Sir Thomas esteemed them 
as ephemerids, sufficiently plentiful however to provide a wife for every husband. 
But then I think he should have said that the 20 to be begotten between 60 and 
100 were all males. Anyhow the sum must be wrong because some of the 
64,000,000 short-lived women of the sixth century should survive into the seventh.. 
Indeed Sir Thomas uses his data a trifle capriciously. 

We must surely play a game according to the rules. We are to accept the 


Masor GREENWOOD 123 


Text word for word as it stands. But, omitting Adam, whose age at his begetting 
of Cain is not recorded, and Noah, who seems to have reached middle age— 
500 years—before becoming a father, the reproductive habits of eight fathers are 
recorded. Two begat males at the age of 65, one at 70, one at 90, one at 105, one 
at 162, one at 182 and one at 187. When this primary business was over, they 
are all recorded to have begotten an unspecified number of sons and daughters. 
So, if we are to be faithful to the Text, a very much more complicated arith- 
metical problem presents itself. A male begets another male at an average age 
of about 100, he then begets males and females at an unspecified rate for say 
another 600 years, required the law of increase. The Text does not authorize Sir 
Thomas to start pre-diluvian breeding at 65 or to stop it at 100. His ‘manifest 
disadvantage’ is breaking the rules of the game. 

Further, the Text does not entitle him to predicate of the other males the 
lengths of days and procreative exploits of the recorded eight. 

All this, it may be said, is breaking a butterfly upon the wheel. Nobody now 
takes the statistics of the Authorized Version literally. The point is that Sir 
Thomas Browne did, but used them improperly. As Lord Chesterfield said to a 
Garter King at Arms of his day who had not followed the rules of heraldry, 
“You foolish man, you don’t know your own foolish business’. 

Petty did not tackle pre-diluvian demography, but he did try his hand at an 
estimate of the world’s population after the flood, ‘To justify the Scriptures and 
all other good Histories concerning the Number of the People in Ancient Time’ 
(p. 465). 

As Petty was not going to allow the population of ancient times to be greater 
than in the seventeenth century, but to make it increase regularly from the time 
of Noah’s Ark, common sense saved him from fantastic figures, but not from 
physiological difficulties. The rules of the game obliged him to start with eight 
landed from the Ark, so he thought it best to make them increase and multiply 
very fast indeed at first and progressively more slowly. At first he doubled the 
population every ten years, but by the birth of Christ has brought the period up 
to 1000 years. But doubling every ten years (in the first century from the Flood) 
leads one into difficulties. 

We can allow the possibility of the four pairs emerged from the Ark pro- 
ducing 8 offspring in ten years and so becoming 16 in year 10, without too great 
difficulty. But ten years later they must number 32 and this és a difficulty. If 
the fecundity of the first settlers remains the same they will contribute 8 more 
children, giving us a population of 24, the balance of 8 must come from the four 
couples of children all of whom must be under 20, and this is a little difficult. 

But at least we may say that there is nothing wholly fantastic in Petty’s 
procedure. Petty does belong to a different arithmetical world from that of 
Browne. Here we may leave purely speculative demography. 

To estimate the people of an area without counting them, we must count 


9-2 








124 


Medical statistics from Graunt to Farr 
something which has a connexion with 


the number of the people 
the tax-payers, the houses. the 


. We may count 
burials, the christenings or the acreage under 
corn—all or any of these items varv with the number of people. 
I wish to keep separate the discussions of Petty’s and Graunt statistical 


earches, but in the n atver now to be examined Petty used some of Graunt’s 
re theds and result 


I , SO these must be considered. 


Graunt used three 


methods of estimation. In the first place, he surmised 
that the number of child-he ring women in a comm nity might be about doubk 
1 . , or ' . 
the number of annual births ‘fora 


~ vis FOTasmuch as such women. one with another, ha, 





ve 
. Lona « > mM rmised tha : 
scarce more th an one child in two years é Then ne surmised tha Tamilies were 
i hild-] ~ ‘ ine . ee 
twice as num fous a8 women of ec! ud-bearing age. His reas hing was tha 
A os 1G . 4 
women vetween 6 and 76 might be twice as numerous as \ en 
nd 40 or 20 and 44 (ie of chil ring age), and he thought of a famil. 
y ] : 1 < 1 
ntred fOUNnd a@ Marrier ple finai le thought that t} V g 
‘ -— i 
vould consist of eicht per i and wife, three chik ‘ 
] , potas 
I ts or | or€ ‘ tar > wit Z ) is In VOHiCn € ( 
i] > »* i 1 > 
ial i suTE l i ne | i wOorniel di Lilt i i 
<i] ne 
rS,' ) families ar VY 3 } 
it lite rt } t Graur ot } n r ty 1 
100 " ' " 
Vt WoW iS an 10 I is 1 é win: ; ] , 
int j > 
t it ITTNS te a we n 7eq +) 3 201 5] H { ? 
hal; 1 Ps) ’ 
it UO il I { i bY O . He 
1 ’ 
i Nf i Kind 
) ) ner Y ) \ ) 
’ ie! 1. VI 30 CLV 
a re € We é nen ii 
] 1 
per ] ) { mu i ) } n tu 
I l il€ 1 ‘TAI ? S iS 
. | 1 
CIS yi) \ t ie ! I q } 
5 
i t Ist n ) 
( be ip 1 by Phe 
ra > a | 
t { 
rau newt t } Ss ? n | j 
} } “1 1 
1 i i 1m li} | I ~ 
1 } j 
l ‘ ) ii i anny i L1VE ad it 
\ 13,{ ; ( is 
I] t } ] 
! t l r LO! 1G 
. . 
R cu “ev il i De ) } is 
) } ; 
1 ; ; 2 | 
: u n 360 Y s 
in ea I > Vv ki 





Masor GREENWOOD 195 





the Walis. But forasmuch as there die within the Walls a it 3200 per Annum, and in the 
whole 13,000, it follows that the ] ising within the Walls is } part of the whole, and conse- 
uently, that there are 47, Families in and about London, which agrees well enough 





with all my former computations (p. 385 


These conjectures led Graunt to think that the rate of mortality in London 
was about 1 in 32. In his first essay on the growth of London (pp. 458-75) Petty 
bases himself upon that estima nd in the series of p: (pp. 505-44) this 
the fundamental met! but Petty all himself to modify 


remains 





I 
multiplier, not altogether without suspicion of bias. At a quite early stage he 


had satisfied himself that London w he lar 


ra in the world and much 
larger than Paris. This is the kind of argument. For the three vears 1682-84, 
the average of burials in London 1 337 and f tris 19,887. So if the rates 
of mortality were the same, Lond A lar; t] Paris.* If the rate of 
mi ity in Paris were higher than in London then the population of London 


must be rger still. According t ty (a) : er proportion of the Paris 





wae ce ae a er ay . a os et 
grea 1n hospital, (0) the orval mn ne Ital was he: Vier 1 Par 
at 1 yulon tH Hosvi is are better nd re esirable than those of Paris for 
that in the best at Paris there die 2 out of 15, whereas at ion there die out of the worst 
6, and yet but at fti on part the whole die out of the Hospitais at London, 
r 20 times that proportion die out of the Paris Hospitals which are of the same 


d; that is te say, the number of those at London who chuse to lie sick in Hospitals rather 
an in their own Houses, are to the like People of Paris as one to twenty; which shows the 
greater Poverty or want of Means in the People of Paris than those of London. We infer 
from the premisses, viz. the dying scarce 2 of 16 out of the London Hospitals, and about 
2 of 15 in the best of Paris (to say nothing of Vhostel Dieu) that either the Physicians and 
Chirurgeons of London are better than those of Paris, or that the Air of London is more 
wholesome (p. 508). 


These, however, are only logical deductions if the user of the hospitals in 
London and Paris is identical. If, as implied in the first part of the quotation, 
\ 


we think of hospitals in the sense which our elder contemporaries think of the 


‘id-fashioned poor law infirmaries, viz. as refuges for the sick poor, it would 


mean that in Paris more of the aged indigent died in institutions than in London 


a ‘ : : . er ward 13} 
avy rtality might well have nothing to do with the skill or lack of skill 
: ss ’ 

i § it we think of hospitals in the ern sens i 1 hea 
1 1 } ° 1 +} } 
) < i \ Ciic¢ ] hi ¢ ti I i tA A \ t i MS 
1 M4 . 7 3a) > » | 
ym illnesses which needed special treatment. In any case, | 
} kit ie Gamat ( -10_11)} : 4 eet 
€ it both ways. in another essay (pp. olV—11) he contrasts the high 


deaths to admissions at hostel Dieu of Paris with that of la ¢ harité, 





- ee: é a ee Eu 

he excess in hostel Dieu is unnecessary and proceeds to calculate 
hould be re ‘red that the London of Petty’s calculations is the whole area within 
The calculations of Graunt described above did not include Westminster or the six out 


Surrey and Middlesex which were within the Bills: Islington, Lambeth, Stepney, 


Ne n, Hackney, Redrifi 








126 Medical siatistics from Graunt to Farr 


what the French nation would gain by saving this excess. But he has not in- 
quired whether the patients of the two institutions were in pari materia. 

Here is an historical problem which might be solved by those familiar with 
the literature of the period. Its discussion would not be relevant here. It is, 
however, only just to Petty to say that, unless conditions deteriorated seriously 
in the following century, his strictures on hostel Dieu were justified. In 
Franklin’s work (La Vie Privée @autrefois. L’Hygiéne (Paris 1890), pp. 177 et 
seq.) an appalling account of this hospital from the pen of the eminent surgeon 
Tenon, printed in 1788, is quoted. Tenon’s description of the routine of this 
great hospital compares, unfavourably, with the story of the wounded in the 
Mesopotamian campaign which horrified England in the war of 1914-18. He 
remarks, inter alia, ‘on ne guérissoit point de trépanés autrefois 4 |’ Hdétel-Dieu, 
comme on n’en guérit pas encore aujourd’hui’, and cites a court surgeon of the 
time of Louis XIV, i.e. a contemporary of Petty, to that effect. His account of 
the treatment of lying-in women is grotesquely horrible. 

In another essay (pp. 533-6) Petty discusses methods of estimation more 
carefully than in his other papers. 

He proposes to show that the population of London (within the Bills) in or 
about 1685 was approximately 696,000. 

There are, he says, three methods: (1) From houses and families. (2) From 
an estimated death rate. (3) From the ratio of those who die of the plague to 
those who escape. ’ 

This last we may deal with at once. Petty asserts that Graunt had proved 
that one-fifth of the people died of the plague. But in 1665, 98,000 died of the 
plague; therefore the population was 490,000, and allowing an increase of one- 
third between 1665 and 1686 we reach 653,000. 

Graunt could not have proved that one-fifth of the population died of the 
plague unless he knew what the population was, and he never claimed to have 
done so. 

The other methods (which Graunt used) are rational. 

To estimate houses, Petty used three methods. He says that in the Fire of 
1666, 13,200 houses were burned and that deaths from these houses were one- 
fifth of total deaths, so he reckons the houses to have been 66,000. Then as 
burials in 1686 were to burials in 1666 as 4 to 3, he makes the houses of 1686, 
88,000. He does not, however, say upon what basis the estimate of one-fifth of 
the deaths in 1666 stands. 

Next, he gives an estimate of the houses in 1682 given him by those employed 
upon a map said to have been made in that year. This map has not been 
identified. 

Lastly, he uses the return of hearths. In Dublin in 1685 the hearths were - 
29,325 and the houses 6400. In London the hearths were 388,000; so the houses 
on the Dublin ratio should be 87,000. In Bristol he says there were 5307 houses 





e 


Masor GREENWOOD 127 


and 16,752 hearths, which give 123,000 houses for London; the mean of the 
calculations is 105,000. The Hearth Office itself, he says, certified the number 
to be 105,315. He must now have a multiplier. He accepts Graunt’s multiplier 
of 8 as valid for tradesmen’s families, but allows for smaller families among the 
poor and larger among the rich, finally choosing 6. He then allows for double 
families in houses by adding 10,531 to his 105,315, and multiplying the sum by 
6 has 695,076 for the population. 

Petty’s second way was from an estimated death rate. 

Petty multiplies the average of the burials in 1684 and 1685 (23,212) by 30, 
which makes the population 696,360. 

He now essays to prove that the death rate in London was 1 in 30. He uses 
four arguments, of which only one is strictly to the point, viz. Graunt’s direct 
observation that three deaths occur annua!ly in eleven families—which however 
involves the assumption of eight persons to the families observed. Two others 
are relevant, viz. observations, apparently direct, that in ‘healthful places’ the 
mortality is | in 50 and in nine country parishes | in 37. The fourth partly rests 
upon a statement which Graunt did not make, viz. that one of 20 children under 
10 dies annually. This fictitious value Petty averages with the statement of a 
M. Auzout to the effect that the rate of mortality of adults in Rome is 1 in 40. 
It will be clear that Petty has proved nothing at all. What he has done is to make 
it unlikely that the rate of mortality was less than 1 in 30. That, perhaps, was 
enough. One has a certain sympathy with his round statement: ‘Till I see 
another round number, grounded upon many observations, nearer than 30, 


I hope to have done pretty well in multiplying our Burials by 30 to find the 
number of the People.’ 


With this I may conclude the analysis of Petty’s statistical work. It will, 
I think, soon be clear enough that it is not of the calibre of Graunt’s. Yet I 
cannot take leave of it without something of an ave. Careless, happy-go-lucky, 
tendentious; yes, all that. But anybody who has felt the exhilaration, to which 
Francis Galton owned, in the doing of sums concerning biological problems, feels 


his heart warmed by the arithmetical knight errant who had so many statistical 
adventures. 


(T'o be continued) 











FIDUCIAL ARGUMENT AND THE THEORY OF 
CONFIDENCE INTERVALS 


By J. NEYMAN 
University of California, Berkeley, California 


CONTENTS 

PAGE 
1. Introduction : ‘ : “ 128 
2. Basic ideas in the theory of nial intervals :, 500 
3. Necessary and sufficient conditions for a pair of fenihiiike 6 to be scien limits 134 

4. Differences between the theory of confidence intervals and the theory of fiducial 
argument : ; . ° . : : 135 
(i) Evidence of ettiigteail Siiisticis between the two theories 135 
(ii) 3: differences between the two theories 138 
5. Views of M.S. Bartlett and R. A. Fisher 142 
. Summary 149 
7. References 150 


1. INTRODUCTION 


THE theory of confidence intervals was started by the present author about 1930. 
At that time it was taught in lectures given both at the University and at the 
Central College of Agriculture, Warsaw, Poland. The theory found immediate 
practical applications, and before any theoretical paper was published, a booklet 
(Pytkowski, 1932) appeared giving numerical confidence intervals for means and 
for regression coefficients. The term ‘confidence interval’ is a translation of the 
original Polish ‘ przedziat ufnosci’ 


. The author’s theoretical results appeared two 
years later (Neyman, 1934). 


At almost the same time the first tables and graphs 
of confidence inte.~ 11s were published (Clopper & Pearson, 1934) in a paper which 
gave a remarkably clear explanation of the difference between the new approach 
to the problem of estimation and the old one, by means of Bayes’s theorem. 
The first publication on fiducial argument (Fisher, 1930) anticipated the booklet 
of Pytkowski by two years. The present author overlooked this article for some 
time. However, when preparing his paper of 1934, he was already acquainted with 
it and also with the next paper (Fisher, 1933) on a similar subject. Although 
Fisher’s method of approach was entirely different from the author’s, the 
numerical identity of Fisher’s fiducial limits with the confidence limits in the 
author’s theory, and also some of Fisher’s early comments, suggested to the : 
that the two theories are essentially the same. Accordingly, and owing to th: 


difference in dates of publications, the author considered his own work as an 


an 


J. NEYMAN 129 


extension of the previous results of Fisher. This was clearly stated in the author’s 
paper of 1934. 

Apart from the above points of agreement the author had found certain 
passages and conceptions in the publications of Fisher which were difficult for 
him to understand and to reconcile with what was essential in the theory of con- 
fidence intervals. They included ‘fiducial probability’ and ‘fiducial distribution 
of a parameter’. However, the author was inclined to think that these were, more 
or less, lapsus linguae, difficult to avoid in the early stages of a new theory. This 
attitude was clearly expressed in the paper of 1934. That paper was read before 
a meeting of the Royal Statistical Society and was followed by a public discussion 
recorded in the Society’s Journal. Fisher took part in the discussion, and it was 
a great surprise to the author to find that, far from recognizing them as mis- 
understandings, he considered fiducial 


absolutely essential parts of his theory. 





probability and fiducial distributions as 

As a result, the author began to doubt 
whether the two theories were, in fact, equivalent. These doubts were only 
increased by Fisher’s insistence that the calculation of fiducial distributions and 
fiducial limits must be limited to cases where.sufficient statistics exist (Fisher, 
1936), and by his warnings against inconsistencies in the theory of confidence 
intervals. 

When questioned on the subject, the author could not conceal his doubts and 
they were published (Neyman, 1938a). Subsequent publications by other authors 
appear to be divided. Some, e.g. the very important papers by Wald (1939) and 
by Wald & Wolfowitz (1939), deal with the theory of confidence intervals, entirely 
ignoring fiducial theory. Others (Starkey, 1938; Sukhatme, 1938; Yates, 1939), 
at the other extreme, work on the ground of fiducial argument and ignore the 
confidence intervals. There is also an intermediate group of authors with an almost 
continuous spectrum of opinions. Pitman (1939), in a very interesting paper on 
estimation of location and scale parameters, states that the two theories ‘are 
essentially the same and that their two points of view are both necessary for a full 
comprehension of the theory of estimation’. And a few pages further: ‘I at first 
alled it the fiducial probability function, but finally decided to shorten the name 
by dropping the word “ probability ”’. 

Next we find the statement (Bartlett, 1939) that * by a distribution of fiducial 
type we shall mean a distribution providing at least confidence intervals in t! 


tne 
used in an argument (Bartlett, 1936, 1939 


sense of Neyman’. This stat ment is 





that, as a distribution deduced by Fisher (1936) does not seem t provide con- 
fidence limits, there must be some error in the deduction. A similar point of viev 

but with a stronger leaning towards confidence intervals, is expressed by Welch 
(1939). In this paper various general claims of Fisher are analysed, essentially 
from the point of view of confidence intervals, and tested on appropriate examples. 


Among other things it is found that the fears of inconsistencies in the theory of 


confidence intervals are unfounded. 








130 = =©Fiducial argument and the theory of confidence intervals 


A quite different school of thought is represented by Jeffreys (1940), according 
to which the fiducial approach to the problem of estimation is completely equi- 
valent with that by inverse probability. 

Fisher (1937, 1939a, 19396) and Yates (1939) emphatically deny that there 
is an error in Fisher’s paper of 1936. On the contrary, it is said that the results 
then published were obscured by the controversy arising from Bartlett’s con- 
fusion about the nature of fiducial argument. Also, especially in earlier papers 
(1930, 1933, 1936), Fisher is equally emphatic on the distinction between the 
fiducial and the inverse probability approaches to the problem of estimation. 

The above survey shows that there is an interesting divergence of opinions as 
to what is essential in the fiducial theory in general and as to whether it is in any 
way connected with the theory of confidence intervals. The perusal of all the 
literature quoted does not allow the present author to form any precise opinion 
as to the first of these questions. On the other hand, there now seems to be sufficient 
ground for answering the second, concerning the relationship between the two 
theories. The purpose of the present paper is to show that there is none. The 
relevant points concerning this question, which were possible to establish on the 
ground of earlier literature, are explained in excellent papers by Pearson (1939) 
and Welch (1939), with the final conclusion that, in spite of various differences, 
the two theories are closely related. However, fresh evidence provided by papers 
of Fisher (19394, 19396) and Yates (1939) shows that no such relation exists and 
that the authors suspecting it were misled by the incompleteness of earlier writings 
concerning fiducial argument. 

As a result of the present paper it may be found expedient, for the sake of 
clarity, to avoid confusion of terminologies appropriate to the two theories. 
Instead of writing, as some authors do, on ‘fiducial or confidence’ limits, it may be 


preferable to discuss ‘fiducial limits’ or ‘confidence limits’, as the case may be, 
separately. 


2. Basic IDEAS IN THE THEORY OF CONFIDENCE INTERVALS 


The key to understanding the theory of confidence intervals is in being clear 
about what might be called the classical point of view in the theory of probability. 
This theory was originally built up to answer questions about how frequently a 
given combination of throws will occur in a long series of games of dice. Thus; the 
probability of a certain combination found to be, say, 1/5, implies that this com- 
bination would appear in about 20 %, of a long series of actual games. This agree- 
ment may, but need not, be observed. In the latter case, we would say that the 
assumptions underlying the deduction were not realized by the actual experi- 
ments. The dice used were perhaps ‘ biased’, and so forth. The point is that, when- 
ever it is said that a given set of probabilities does refer to some phenomena, then 
it is understood that the relative frequencies of various aspects of the phenomena, 


J. NEYMAN 131 


in a long series of trials, are approximately equal to corresponding probabilities. 
This is just what the author calls the classical point of view in the theory of 
probability. [tis excellently explained by v. Mises (1939), but is more general than 
the definition of probability adopted by that author.* 

Apart from the classical point of view on probability, there is another. It 
considers the probabilities as measures of rational belief in the truth of a given 
proposition. Here the agreement between the probability and some relative 
frequency is not essential. 

The theory of confidence intervals was built up to give a solution of problems 
of estimation which would have a clear frequency interpretation, characteristic 
of the classical point of view. Consider a set E of n observable random variables, 
%1,---)%,, and assume as given that the function p(# 





6,, 4, ...,9,) represents its 
elementary probability law. Here @,, ...,.0, represent certain parameters whose 
values are unknown. 

The above should be interpreted as follows. There are some actual trials 7' 
which are able to determine the values of the x’s. There are also some numbers 
31, 02, ...,0,, unknown to us, such that, whatever be a region w in the space of 
the a’s, the integral of p( H | ,, #,, ..., 9,) taken over this region is approximately 
equal to the relative frequency with which the point HZ, as determined by the 
trials 7', falls within that region w. The problem of estimating one of the para- 
meters, e.g. #,, consists in using just one system of the x’s as determined by the 
trials 7' to calculate }, approximately. Alternatively, it may consist in calculating 
an interval (a,a+d) which ‘presumably’ covers 9). 


The original approach to this problem is based on Bayes’s theorem. Denote 


by p(0,, 4., ...,4,) the elementary probability law of the ’s. Then 
. (8, ..., 8) p(B’ | 0, ...,9, 
(91, 9, 0, | R’) = Px 3) P(S | % 3) (1) 


si |P(0,, .»+9 9.) p(H’ | 0, ...,9,) dA, ...,d0, 


will be the relative probability law, or the probability law a posteriori of all the 
@’s given the observed system H’ of the values of the x’s. It can be used to calculate 
the most probable value of #,. Alternatively, given a number d > 0, the law can be 
used to find the interval (a,a+d) such that the a posteriori probability 
P{fa+d>06,>a| EK} 

is greatest. 

Our attitude towards this kind of solution, dictated by the classical point of 
view on probability, depends on circumstances and may be twofold. 

The circumstances of the problem may imply not only that the 2’s but also that 
the 6’s are random variables and that the function p(9,, ...,0,) could be used to 

* Tt will be noticed that the classical point of view or probability does not imply any particular 


definition of that concept. It is not suggested that the one adopted by v. Mises is the only one 
that could be consistently used. 











132 Fiducial argument and the theory of confidence intervals 


calculate the relative frequencies of various combinations of values of the 6@’s. 
Such situations are rare, but they do occasionally occur, especially in problems of 
genetics and of mass production. If the function p(,, ...,0,) is implied by the 
problem considered, then the probability P{a+d>0,>aj| E’} has a clear ire- 
quency interpretation, as follows. Imagine a long sequence, S, of cases where the 
@’s vary according to the above law and the z’s are determined by the particular 
trials considered. Pick from this sequence S a subsequence S(H’) of such trials 
in which theexperiments determined the same system of values of the x’s, namely 


the system EH’. Naturally, the value of 0, in cases belonging to S(H’) would vary. 


But, if the functions p(Z | 6,, ...,6,) and p(6,, ...,@,) do have the presumed relation 
to the trials considered, it will be found that among all the intervals of length d, 
the interval (a, a +d) will contain the value of 6, more frequently than any other, 


2nd that this frequency will be approximately equal to Pfa+d>6@,>a| E’}. It 
follows that, if the function p(,,...,0,) is implied by the circumstances of t! 
problem of estimation, the use of the formula (1) is perfectly legitimate from the 


point of view of the classical theory of probability. 
a 


Che situation is quite different when the circumstances of the problem do not 
imply the a priors probability law. This is most frequently the case. Moreover 
usually there are serious difficulties in considering the 6’s as random variables. 
Jeffreys (1939) advises the use of formula (1) also in such cases, with a function 
p{9,, ...,9,) invented for the purpose. He claims that the conclusions drawn in 
this way are valid, provided that the function used is just the one that he suggests. 
The present author would not question this statement on condition that the word 
‘valid’, or any other such description, is not given any significance beyond that 
described above. In other words, there seems to be no reason why we should not 
agree to call the above conclusiuns ‘valid in the sense of Jeffreys’. On the other 
hand, it seems essential to be clear that any probability calculated from (1), with 
any function p(0,,...,9,) not implied by the actual problem, need not and, 
generally, will not have any relation to relative frequencies. It will not be the 
probability in the classical sense of the word and, therefore, persons who would 
like to deal only with classical probabilities, having their counterparts in the 
really observable frequencies, are forced to look for a solution of the problem of 
*stimation other than by means of the theorem of Bayes. 


This solution (Neyman, 1937, 19385) may be obtained as follows. Consider 


4e where the Ircumstance imp} thar the form ga S\ 
| mr EE } I 
random riaole Vith tne provaviuley 1a A 
i . i . 
t | j ] I { ‘ ()} 
} y i te DY G( 4) and O(f) two fun hs ¢ >ws. UD 
. . on ' 
f a unctions will also be 1 \ 
DEFINITION | if the functions Gi) and GU i } y th 
i i POSS ? U; ol @ md whatei D new Oj ih MATLOU 
1 “J 
) . Io 
parameters 0, 05, «+, 0,, the probability 
i ff Be) “Rh <f Kh) } } (< 
1 \ i 2 8 





J. NEYMAN 133 


then we will say that the functions 9(E) and OE) are the lower and the upper 
confidence limits of 0,, corresponding to the confidence coefficient a. The interval 


(0(#), O(B)) is called the confidence interval for 6,. 


In spite of the complete simplicity of the above definition, certain persons have 
difficulties in following it. These difficulties seem to be due to what Kar! Pearson 


In the present case the routine was estab- 


(1938) used to call routine of thought. 
lished by a century and a half of continuous work with Bayes’s theorem. It may 
be useful, therefore, to give a few illustrations 

Assume that s = 2, that 0, may have only the five values 1, 2, 3, 4, and 5, and 
that, at the same time, #, may vary continuously between zero and |. To satisfy 


Definition 1, the only requirement on the functions @(£) and 6(E) is 


for all values of 3 1, 2. 3, 4, and 5, and for arying between (0 The 
bilities (2) and (3) are ef not the probah of 9, fallin thin any 
nits in the cont t \ r } nrobahbili 20 the fi t f } EB) 
i 7 O! sides of a spe nber &. Th rrobahbill » be caleu- 
I from the given funct ) th the value of @, se qual me 
\ Tee |i I It m lepen it of \ l f i nd 
vi 
s | Nf nan ertal M n 
1d B(E) e Ci} aie 
| ble 1, IO! le, ‘¢ t I V 2) I 
g 9) we. i ) yirs of 
ni li tical 
tisti is 2a i 
i lei f 
2 
1 i) 
Te: { ) 
) d 
} 
€ l ri ( ilt 
| ) ’ 
l i ) 
yr i j Ot 
t i 
t y 
1 
J na 
een ‘ d O( lude 
\ ) iV u l i i 
{ 1ed \ > c 
T > 











134 Fiducial argument and the theory of confidence intervals 


seems to denote the mental process leading to knowledge. As such, it can only be 
deductive. Therefore, the description ‘inductive’ seems to exclude both the 
‘reasoning’ and also its final step, the ‘conclusion’. If we wish to use the word 
‘inductive’ to describe the results of statistical inquiries, then we should apply it 
to ‘behaviour’ and not to ‘reasoning’. The fact thata given pair of functions 0(#) 
and 6(£) satisfies the identity (2) may be ‘deduced’ from the properties of the 
function p(E | 6,,...,0,). Earlier trials may show characteristics in the empirical 
distribution of the x’s which seem in agreement with the function p(E | 4, ..., 4.) 
On these grounds, after observing the values of the z’s in a case where the 6’s are 
unknownand calculating 6(Z’) and 6(Z’), we may decideto behaveasif weactually 
knew that the true value 3, of 0, were between 4(£’) and 6(Z’). This is done as a 
result of our decision and has nothing to do with ‘reasoning’ or ‘conclusion’. The 
reasoning ended when the functions 0( Z) and6@(£) werecalculated. Theabove pro- 
cess is also devoid of any ‘belief’ concerning the value 3, of @,. Occasionally we do 
not behave in accordance with our beliefs. Such, for example, is the case when we 
take out an accident insurance policy while preparing for a vacationtrip. In doing 
so, we surely act against our firm belief that there will be no accident; otherwise, 
we would probably stay at home. This is an example of inductive behaviour. 

Obviously, if there are many different pairs of functions, @(#) and (2), all 
corresponding to the same a, our choice of the one to use must be based on the 
detailed study of their properties. For example, if it appears that the difference 
between one pair, 0,(#) —@,(Z), is always (or most frequently) smaller than that 
between some other pair, then we would probably prefer to use the first. The 
problem of determining the confidence limits and of studying their properties 
forms the subject of the theory of confidence intervals. 


3. NECESSARY AND SUFFICIENT CONDITIONS FOR A PAIR OF 
FUNCTIONS TO BE CONFIDENCE LIMITS 
Let a(H#) < 6(E) be any two single-valued functions of the x’s determined for 
all possible systems of their values. Denote by W the space of the «’s and by 3, 
one of the possible values of 6,. Finally, let A(9,) denote the region in the space 
W composed of all points HZ which satisfy the double inequality, 
a(£) <3, <6(£). (4) 
It was proved (Neyman, 1937) that for the two functions, a(£) and b(£), to be the 
lower and upper confidence limits for the parameter 0,, it is necessary and sufficient 
that, whatever be the possible value #, of #,, the probability 


P{E¢ A(9,) | 6, = I j=a. (5) 

The identity refers to the arbitrary variation of 04, ..., ,. 
This condition will be used below to show that a certain pair of functions does 
not represent the confidence limits. For this purpose, the following steps will be 
taken: We shall select a convenient value 3, of the estimated parameter 0, and 





J. NEYMAN 135 


determine the region A(#,) as in (4). Next, we shall substitute this same value #, 
instead of the parameter @, in the elementary probability law of the variables 
considered, getting p(H|,,...,0,). This last function will be integrated over 
A(4,) to find the probability P{# eA | 0, = },} as in the left-hand side of (5). But 
this integral will be dependent on the values of the other parameters involved, 
showing that the identity (5) is not satisfied. The conclusion will be that the 
particular functions considered are not confidence limits. 


4, DIFFERENCES BETWEEN THE THEORY OF CONFIDENCE INTERVALS 
AND THE THEORY OF FIDUCIAL ARGUMENT 


In this section we will consider examples treated both from the point of view 
of confidence intervals and of fiducial argument. 'These will be selected to illustrate 
both the conceptual and the numerical differences between the two theories. 

(i) Evidence of conceptual differences between the two theories. The first results 
obtained concerning confidence intervals (Neyman, 1934) refer to the case where 
all the n observable variables x; are mutually independent, normally distributed, 
have the same though unknown standard error. o, and expectations &(x;) which 
are linearly connected with some s <n unknown parameters 9, Pg, ..., P,, 80 that 

E (Xj) = jy Py + GigPat -.. + ig Ds- (6) 
Here'the a’s are supposed to be known and to form a non-singular matrix. Denote 
by @ any linear combination of the same p’s, that is 
6 = bp, + bypat...+5.D,; (7) 
with known 6’s not all equal to zero. In these circumstances, a confidence interval 
for @ is given by F—St,<0<F+St,, (8) 
where F' denotes the best unbiased estimate of @ (David & Neyman, 1938), S the 
estimate of the standard error of F, and t, the value of the ‘Student’-Fisher ¢ 
corresponding to the number of degrees of freedom n—s and io P = 1—a. The 
application of more recent theory (Neyman, 19355) shows that the confidence 
intervals (8) have distinct advantages over any others by satisfying the definition 
(Neyman, 1937) of the ‘short unbiased system of type B,’. Without entering into 
these details, we shall consider the particular case where s = 1, a;, = 1 and 
b, = 1. This will be the case if all the z’s come from the same unknown normal 
population and it is desired to estimate its mean, 6 = &(x;). In that case F = Z 
and a0 on D(x, —- 2) (9) 

As mentioned, the general confidence interval (8) was discussed in lectures about 
1930, and in 1932 a publication appeared using the concept and the formula (8). 

As far as is known, the first full discussion of the corresponding result in the 
fiducial theory was given by Fisher a few years later (Fisher, 1935, 1936), and 
here is the relevant passage from the second paper. 











136 Fiducial argument and the theory of confidence intervals 


If a sample of n observations, 2, ...,%,, has been drawn from a normai population 
having a mean value /, and if from the sample we calculate the two statistics = Za,/n and 
s? = L(x,—Z)?/(n— 1), ..., ‘Student’ has shown (1925)* that the quantity ¢, defined by the 
equation a , 

(—p) Jn 
t=, (10) 
8 
is distributed in different samples in a distribution dependent only from the size of the 
sample, n. It is possible, therefore, to calculate, for each value of n, what value of ¢ will be 
exceeded with any assigned frequency, P, such as 1% or 5%. These values of ¢ are, in fact, 
available in existing tables (Fisher, 1925-34). 

It must now be noticed that ¢ is a continuous function of the unknown parameter, the 
mean, together with observable values, , s and n, only. Consequently the inequality 
t>t, is equivalent to the inequality 

p<%—st,/J/n, (11) 


so that this last inequality must be satisfied with the same probability as the first. This 
probability is known for all values of t,, and decreases continuously as ¢, is increased. Since, 
therefore, the right-hand side of the inequality takes, by varying ¢,, all real values, we may 


1 


state the probability that y is less than any assigned value, or the probability that it lies 
between any assigned values, or, in short, its probability distribution, in the light 


of the 
ample observed 


It is of some importance to distinguish such probability statements about the value of 


yt, from those that would be derived by the method of inverse probability, from any 
postulated knowledge of the distribution of w in the different populations which might have 
been sampled....To distinguish it from any of the inverse probability distributions de- 
rivable from t} e data it has been termed the fiducial probability distribution, and the 
probability st nents which it embras are termed statements of fiducial probability. 
In the next : ion hall analyse the ove passage in detail and show 
4 | j } +4 nis be 4 } . } +} “17 £ 134 ] 
exactly where and how it conflicts with the classical theory of probability and 
} > i} ‘ ey man ~~ 4 1 -_ ¢ : 7 M4 > 4. = : 
thus with the the contidence intervals. Here we will mention only that it is 
~ = . | | = ; ] > LASersaae ey x . } co al rs Ms . Wy y 
yum biguous. t this kind of ambiguity, which is also found in the earlier papers 


(Fisher, 1930, 1933), is probably responsible for a number of authors, including 
the present one, thinking that the fiducial theory and the theory of confidence 
intervals are linked. 

n a few years it was found necessary to reinterpret formula (11). This was 
f (1939 6) and, somewhat more clearly but on the same lines, 


by Yates (1939). It will be seen from the following quotation from Yates’s paper 


that the above passage by Fisher certainly does not contain everything which is 
now considered essential in the fiduciai theory and that the presumption of any 
link between the latter and the theory of confidence intervals is unfounded. 


Yates’s more relevant sentences are italicized by the present author. 
While explaining the meaning of the fiducial distribution of the mean y of a 


normal population, Yates mentions that the fiducial distribution of c? is given by 


i) 


1 X 
~ 


32> Sy> _ m3? (12) 


io 


where x” has its usual distribution with n— 1 degrees of freedom. 


* Actually, of course, this result appeared earlier (‘Student’, 1908). 


\ 





J. NEYMAN 137 


It can then be shown that, for a value of 4 equal to yw, and a given s, the value of Z in 
subsequent samples would be as small as that observed in a fraction € of the samples, 
provided that the actual distribution of o* is the same as the fiducial distribution given above. 

In this form, however, the statement is open to objection on the ground that in subse- 
quent samples o may in fact be distributed in any manner, and that s will certainly vary 
from sample to sample. To avoid this objection we must frankly recognize that we have here 
introduced a new concept into our methods of inductive inference, which cannot be deduced by 
the rules of logic from already accepted methods....That is...the form of fiducial statement 
which is implicit in the ¢ test as ordinarily used by practical experimenters. ...It must be 
recognized as essentially different from the statement that ¢ will exceed ¢, in a fraction € 
of all experiments. The latter is true for any given fixed o or any set of o’s. The former 
(i.e. the fiducial statement, J.N.) is true for a given s when o is taken to be fiducially distri- 
buted in the appropriate distribution....The logical difference between the two approaches 
(fiducial and inverse probability, J.N.) should, however, be recognized. The approach by 
inverse probability enables fiducial statements about » to be derived from the classical 
theory of probability, without the introduction of any new principle, but only at the cost of 
postulating a particular a priori distribution of. a. In the fiducial approach such a priori 
postulation is regarded as inadmissible, but in order to discard it a new principle, that of 
utilizing the fiducial distribution of o, must be introduced ....Once the principle is accepted it 
is possible, given Z and s, to make formal and exact statements of the fiducial type about y 
which are independent of all prior knowledge of o. If the principle is not accepted, then it 
appears that we must either assume an a priori distribution of o, or deny that there is any 
possibility of making fiducial statements about p. 


The present author is unable to understand the exact meaning of what is 
called ‘fiducial statements about ~’. However, his conclusion is that their con- 
ceptual nature must be quite different from that dealt with in the theory of 
confidence intervals. This conclusion is based on the fact that all the difficulties 
described by Yates as inherent in the fiducial theory are non-existent in the theory 
of confidence intervals. Applications of the latter require no new principle ‘which 
cannot be deduced by. the rules of logic’, no assumption that this or that unknown 
parameter follows any specified distribution, and have no connexion with Bayes’s 
theorem. To make the situation absolutely clear, imagine a sequence of normal 
populations 7,, 7, ...,7,,.--, With their means 6,, 4 ,...,9,,... and their stan- 
dard deviations ,,@ ,...,0,,.... Imagine that out of each population 7,, we 
have a random sample &,, of n individuals, with its mean Z,, and an estimate of 
the corresponding variance S?, as in (9). The theory of confidence intervals 
guarantées that the relative frequency with which Z,, —t,S,, will fall short of the 
corresponding 6,, and, at ihe same time ,, +#,5,, will exceed this same number 
6,,, Will be, within an error of sampling, equal to a. An incredulous reader may 
easily check this by a sampling experiment. In this he will be at liberty to keep 
#,, and/or o,, constant, or to vary them at his pleasure, without any restriction. 
Of course, the distributions of the populations sampled should be more or less 
normal and the sampling should be random. It follows from the above passages 
of Yates that if the requirements above are satisfied and no new principles 
accepted, then we have to deny that there is any possibility of making fiducial 
statements about 0,,. If so, then the nature of the latter is different from those 
involved in the application of the theory of confidence intervals. 


Biometrika xxx 10 











138 Fiducial argument and the theory of confidence intervals 


The comparison of the above comments by Yates with those of Fisher gives 
a curious impression. Where Yates sees so many difficulties and restrictions, 
Fisher mentions none. Yet this very publication of Yates is fully endorsed by 
Fisher (1939). 

(ii) Numerical differences between the two theories. Besides establishing the 
existence of conceptual differences, it is essential to show that the two theories 
may give different numerical results. We may conclude from the discussion above 
that the application of confidence intervals requires fewer restrictions. But there 
is a logical possibility that, when both theories are applicable, they give the same 
numerical result. The following example shows that this is not the case and that 
fiducial limits need not satisfy the definition of confidence limits. 

The example that we are going to discuss refers to the problem of estimating 
the difference, say 3, between the means of two populations of which it is known 
only that both are normal. Denote by 


Wins “ee, ccs | aes 
11 1,2 Ln } (13) 


Ses, Bam <-s Ser 
two random samples to be drawn from these populations and let n<n’. The 
confidence limits for 6 have been very elegantly obtained by Bartlett. He did not 
publish his results himself but they are briefly mentioned in a paper by Welch 
(1938). The tendency towards a greater generality of presentation resulted in 
certain complications. The followirig is a less general but simplified statement of 
the results.* Assume that the 2’s in (13) are numbered in the order in which they 
will be given by observation. Otherwise, randomize the second series. Next 
calculate n differences 

U;=%4—X%q, (t= 1,2,...,2). (14) 

If &(x,,;) = 0+6 and &(x,;) = 6, then &(u,;) = 6. If the s.p.’s of the two 
populations sampled are o and o”’, then the s.r. of u; will be (¢?+0’?)!. The con- 
secutive u’s will be normal and independent and the problem of estimating the 
difference between the means of two normal populations will be reduced to that 
of estimating the mean of one population of the w’s. Its solution is given by the 
confidence interval 

u— St, <d<U+ St.) (15) 
where S has an obvious meaning and &,, is to be taken with n—1 degrees of 
freedom. 

Again, an experiment consisting in repeated sampling of pairs of normal 
populations will show that, whatever be 0, 3, c, 0’, whether constant or varying in 
an absolutely arbitrary manner, the relative frequency of cases in which the 
statement about din the form of (15) will be true will be approximately equal to a. 
The above solution of the problem, elegant as it is, is only a partial one. The results 


* Apart from these, the same author has obtained certain relevant results referring to the case 
where x = n’ = 2 (Bartlett, 1936). 


























J. NEYMAN 139 





found by him exhausts all the possibilities and whether it is possible to construct 
intervals which would be, in one sense or another, shorter than those given by 
(15). These are interesting and important problems and we may hope to have 
them solved. 


A result in fiducial theory corresponding to, but not equivalent with, formula 


h 
(15) has been published by Fisher (1936) 
Let us suppose that a sample of n observ 


variance of the mean, s*, so that s* = Z(2; 
of the population 





tions has yielded a mean, #, and an estimated 
> 


— %)?/n(n — 1); then we know that if y is the mean 





“= +st, (16) 


where ¢ is distributed in ‘Student’s’ distribution. 


p= Zz’ -- wt. (17) 
where ¢’ is distributed in ‘Student’s’ distribution wit] 


1 n’—1 degrees of freedom, inde- 
pendently of ¢. If now 


w’—-p=d6, # -—Z#=d, (18) 
we find that €=d—d= s’t’ —si, (19) 


and since s’ and s are known, the quantity represented on the right has a known distribu- 


tion, though not one which has been fully tabulated. The equation may be written 


I 


c= vis? +s”) (t’ cos R—t sin R), (20) 
where tan & = s/s’, so that R is a known angle. If ¢ and #’ be taken as the co-ordinates of a 
point on a plane, the frequency of the observations falling within any. area 

calculable. The points for which ¢ has any given value lie on a straight 
from the origin + €/(s*+8”)#, hand making an angle R with the axis of ¢. The fiducial prob- 
ability that € exceeds any given value is the frequency in the area above this linc. If n 
and n’ are both increased, the distribution of ¢ tends to be normal 
when R is 0° or 90° the distribution is of ‘Student’s’ forrn. In gene 


R and for any chosen probability, therefore, requires a table of triple 





of the plane is 






e, at a distance 


and independent of R; 





involves n, n’, and 
As the reader will notice, no restrictions are mentioned and it is not suggested 


c 


that for the practical application of the results any assumption is needed con- 
i+} 
i 


cerning the variability of the variances of the populations sampled. Neither is 
there any suggestion of any new principle that may be involved. We will return 
to this point below. 

Following the publication of Fisher just quoted, and on his advice, Sukhatme 
published a table (Sukhatme, 1938). The quantity tabled may be denoted by 
f(n,n’, R) and represents the root of the equation 


co 


00 , H(t’)de' dt = 0-025, (21) 





where G(t) and H(t’) are ‘Student’s’ distributions with n — 1 and n’ — 1 degrees of 
freedom respectively, while 
f(n,n’, R) 
pm et tian B. 22 
. (s?+8"2)tcos R* ss a 


10-2 











140 Fiducial argument and the theory of confidence intervals 


It follows from the context that f(n,n’, R) so calculated is the value such that 
the fiducial probability of its being exceeded by | e|/(s?+9s’*)! is equal to 0-05. 
In other words, the values f(n,”’, R) are the fiducial 5% limits of | € | /(s? +8’). 
As ¢ = 6—d, if the presumption that the fiducial limits necessarily lead to 
confidence intervals be true then this means that the double inequality 


z’ —%—f(n,n’, R)./(s*+8'%) <d<% —E+f(n,n’, R)./(s? +38") (23) 


must be the confidence intervals for d = w’—y. But it is easy to see that the 
functions on the extreme parts of (23) do not satisfy the conditions, explained in 
§3 above, necessary and sufficient for them to be the confidence limits. Take 
5 = 0 and denote simply by A the region in the space of the z’s including all the 
points in which the inequality (23) is satisfied. Take the probability law of the 
xs and put é = Oin it, that is, uw’ = w. It will be seen that the integral J(A) of this 
probability law taken over A depends on the ratio p = a/o’ of the two o’s appro- 
priate to the two populations sampled and, thus, that it does not satisfy the 
identity (5). 

Condition (23) defining the region A does not involve the particular z’s but 
only the means Z, 2’, and the variances s? and s’*. Consequently, to calculate 
I(A) we may start with the probability law of those four variables 


nN—29/n'—2 


 gng'n’ 


= S «te... 4 oe 32 "ies? .... "2 
x exp] — Ear ee nee |, es 


9 





20? 2o0°"2 20? 20” 





where c is a purely numerical constant and does not involve any of the parameters. 
This function must be integrated over the region A defined by (23) or by the 
equivalent inequality 
ei 

|z’—-z| _, / inthe 

i. ¢f(4.%’, R). (25) 

(et 0'2) <I ? > ) 
In dealing with it, we have to remember that RF is not a constant but is connected 
with s and s’ by the equation tan R = s/s’. The required integral, or probability, 
of %, %’, s, and s’ satisfying (25) will be more easily calculated if we introduce a new 
system of variables, uw, v, R, and sy. These will be connected to the old system as 
follows: 


x = w+vs8, cos R, 
; (26) 
8, sin R, | 


8 cos R. 


% = “+usysin n| 


8 


Il 


, 


8 


ll 


The Jacobian J of the transformation is easily found to be 


J = sisin Roos R. (27) 


27) 


J. NEYMAN 141 
The limits of variation of the new variables are as follows: 
—O<Uv< 14 


0<8p, (28) 


The probability law of the new variables will be 





p(u, v, 84, R) gntn’—l eis) sin™—! R cos *’-! R, (29) 


= oo” 


‘ ,o nu*sin?R n’v?cos?R n(n—1)sin?R  n’(n’--1)cos*® 
with y? = — — +— — 


~ ——— 30 
o* o* o ao” sie 

Inequality (25) will be equivalent to 
|vcos R—usin R| <f(n,n’, R). (31) 


As this does not involve s, the integration with respect to this variable can be 
carried out within the extreme limits of its variation. As a result further integra- 
tions may be performed on the probability law of u,'v, R, 


p(u, v, R) = | p(u, ¥, 8, R) ds, 


J90 


c sin™-! Reos”-1R 


el "= 9 
ats org’ yyntn’ ? (32) 





where c is again a numerical constant. 
Further integration may be conveniently carried out as follows. Substitute 
a new variable z for the variable v so that 


z+usinR cy l 


: =o oe. 33) 
cosR ° ¢z cosR 7 


Keep z constant within the limits | z| <f(n,n’, R) prescribed by (31) and integrate 
for u from —0oo to +00. The result is 
csin™-? Reos"-? R 


2, R) = ———. 73 
P(2, ©) = Gnmigin “1 (no’? + n'a?) 


, , , ° ‘ 
nn >, mn—1). n'(n’—1 —intn'-l 
> es = = ° 2* + a(n 2 ae ) sin? R + de cos" R| (34 ) 
\no’2+ n'a? o ao’? j 


The integration is completed by an easy substitution for z 
I(A) _ cp” —1 


("| sin”-? Rcos”-2 R We dz | 
( 


7 EF eT = 2 "I Hiaren; eR, (35 
» \(n(m— 1) sin? R + n’(n’ — 1) p? cos? R)H™+r'-2 J (14 22) Hate’—-D] : (35) 





sdurcial ara t and the theory of cont 
Fiducial ay gument ana the theo y Of confide 








Statistical La 


author’s indebtedness to her. The calculations involved supple ix 1e ta} 








sy 





n ij/n(m—1) gay 
n {n(n A) sem WI i) > pm) 
— - | | it + rei ' 
7 of | a i v7 19 UU Le j 
rnuzo'y \ ad ba 
more or less evident that J ist depend on t 7 








: <, é an 
1 any doubt l nis e 2 tb WaS 6D en i 
FH. Wh 172 = } A Pa 
Lues OF ¢ his 3 yy M wh cott 
niversity of Califo it is a pleasure to recor 


pet Lae cD %,,- 1 : , e r \ 
nser set of values of R. The calculat alues of I(A 





riginal deductior 


tou! 


lusion that the presumption of intrinsic identity 





ne= 12, 2x 
) [( A) 
LT — é > 
}-] 66 
1-0 960 
10-0 )-934 
senti I ! iOl C | 
ent 3 | E O DE > 1 i | aniete 
f £ } 
t rmiai pop ion ! o sequence 
parts Of ti qaouple in uall aiculated. the 
ases where the prediction of the vaiue of 6 DY means 
» correct need not be equal to the expected 0-95. It wil 
| } } { ; } 
a, 1 i l } this treque Wi ee unknOWw! 
\ ; } Fisher 320 . that ¢ ] 
ercTe t I QF i} i ) a 
v1 distributi 
Tiginal orl ic " ) 
se I ) nul \ 
Vi +2 ry ) . 
Vi ’ 4 I ] \ 
1, +} na yntri if 
102 1 
Le VU 2 i ) t i) 
' ' ; 
t ii ll i { 3 ¢ ) ym i ‘ 
t Hick re It neerni just quoted, and 


' : . . 
ubsequent elaborations by Fisher an Yates amount to 


ues of f(n,n’, R) as tabled by Sukhatme do no provide 





both authors are emphatic that there is no error in 


und that Bartlett misunderstood the problem. It is 


unanimous papers are mistaken and, therefore. we 


between 
imits is unfoundec 





J. NEYMAN 143 


was published, there was much to be said in favour of the opinion that the solution 
of Fisher, as quoted above, and the work of Sukhatme both involved errors in the 
algebra of probability laws. It also seems that, apart from establishing that the 
fiducial theory and the theory of confidence intervals are distinct, it will be of 
some interest to analyse Fisher’s work in detail and to point out exactly where 
and how it diverges from the rules of ordinary theory of probability on which the 
theory of confidence intervals is based. 

When a system of observable phenomena is treated mathematically, it is 
essential to be clear on exactly what is assumed as given oras known. For example, 
when trying to calculate the area of land from a certain set of measurements, it is 
essential to be clear as to assumptions made concerning the shape of the land 
considered. The available data may be consistent with a number of such assump- 
tions, e.g. that the surface considered is a plane or that it is spherical with a given 
radius, etc. Whichever of these hypotheses is accepted as given, the applications 
of the appropriate formulae will give mutually consistent results. But they would. 
not generally be consistent if one part of the calculations were made on one 
hypothesis and another on a contradictory one. The differences may be small, but 
in mathematics there are really no ‘small’ nor ‘large’ inconsistencies. There are 
simply inconsistencies. Needless to say, the choice of exactly what is to be 
accepted as given must be made to attain the greatest conformity with empirical 
facts. But this is a question which need not be discussed here. 

The above general principle also applies to the applications of probability. 
There we must be clear as to exactly what are the phenomena or the variables 
which we agree to consider as random in a given inquiry. In practice, of course, 
the random variable will be the one whose value at the moment is uncertain and 
is being determined ‘by chance’. If X is considered as a random variable, the 
premises of the mathematical problem must include some assumptions as to the 
relative frequencies with which X assumes its possible values. These assumptions 
may vary in specificity, but they must be present in the premises. 

Any number or variable which is not random must be clearly recognized as 
such. For some time such non-random numbers were called constants. This was 
more or less satisfactory with constant numbers. But Fréchet (Fréchet, 1937) has 
noticed that we may also consider variables which are not random and has in- 
vented useful terms to describe them. These are ‘nombre certain’, ‘fonction 
certaine’, etc. We will translate these terms by ‘sure number’ and ‘sure function’. 
The thousandth digit in the expansion 7 = 3-1415... is a sure number, although 
totally unknown to me. Denote by f(n) the relative frequency of 0’s among the 
first n digits of the same expansion of 7. This will be a sure function. On the other 
hand, if (x) denotes the number of errors that may be made when calculating 7 
to n places of decimals, then ¢(n) may be considered as a random function of n. 
Considerations of this kind would imply those of a considerable sequence S of 
similar attempts to calculate 7, by the same person or by different persons of a 











144 Fiducial argument and the theory of confidence intervals 


specified category, in which the values of ¢(7) will vary, as we shall say, at random. 
It is with respect to just such a sequence of determinations of the values of the 
function ¢(n) that our probability statements will refer. For example, if we either 
start or finish our calculations with the probability equal to 0-25 of ¢(n) being 
between any two sure numbers a,b, then the applicational statement is that about 
25%, of the numbers of the sequence S satisfy the inequality a<¢(n) <b 

It is important to notice that the sequence S may consist of just one member; 
then all the proportions relating this ‘sequence’ will have to be either 0 or 1. In 
other words, if the sequence of ‘random’ determinations consists of just one 
element, this element will have the property of a sure, not a random, object, in the 
usual sense of the word. 

Now let us turn to the passage from Fisher’s paper quoted above, p. 136, and 
try to see exactly what is supposed to be random there and what elements of the 
problem are treated as sure numbers or sure functions. These details in the set-up 
are not stated at the outset, but there is no difficulty in collecting them from 
appropriate passages in the paper. We first see that the function ¢ of (10) is sup- 
posed to be ‘distributed in different samples...’. This means that ¢ is a random 
variable and that its randomness depends on what is found in those repeated 
samples, namely, the values of Z and s. It follows that the probabilities concerning 
Z, 8, and t refer to the sequence S of those ‘different’ samples. The sequence could 
not consist of just one sample because, in such a case, the ‘distribution’ of t would 
not be anything like ‘Student’s’ law. The references to a normal population 
sampled and to ‘Student’s’ law indicate, on the contrary, that the sequence S 
of samples is very large indeed, and that the distributions in it are comparable to 
those represented by continuous curves. 

Up to this time we have. not mentioned the population mean y» which is also 
involved in the expression of t. Obviously, this may be treated mathematically 
either as a random or as a sure number. Both methods of approach are at our 
disposal but, in order to avoid inconsistencies, we must be clear as to which one 
we follow. The indication of Fisher’s choice is found a little further on in this 
article, in the place describing the distinction between the fiducial and the inverse 
probability approach: ‘It is of some importance to distinguish such (fiducial) 
probability statement about the value of «, from those that would be derived by 
the method of inverse probability from any postulated knowledge of the distribu- 
tion of ~ in the different populations which might have been sampled.’ This 
sentence does not seem to leave any ground for doubt. In the fiducial approach 
we consider but one population sampled and no distribution of yu is postulated. 
Therefore, 4 is a sure number and, if ¢ is distributed according to ‘Student’s’ law, 
it is a result of the appropriate variability of Z and s alone. 

The symbol t,, which also comes into play, is obviously a sure variable capable 
of any real value between —0oo and +00. We may select it as we wish and then 
obtain the probability P(t,) of the random variable ¢t exceeding ¢, from tables. 


ww 


J. NEYMAN 145 


Following the article, we will readily agree with Fisher that the inequality (11), 
namely, 4 < % —st,/,/n, is equivalent to ¢ > t, and that it must be satisfied with some 
probability P(t,). Now consider the phrase: ‘Since, therefore, the right-hand side 
of the inequality (i.e. Z—st,/,/n) takes, by varying t,, all real values, we may state 
the probability that ~ is less than any assigned value, or the probability that it lies 
between any. assigned values, or, in short, its probability distribution in the light 
of the sample observed.’ From the point of view of ordinary logic and of ordinary 
theory of probability this phrase is inconsistent with the original set-up. The first 
inconsistency is involved in the words which are italicized, suggesting that % and 
s in the expression %—st,/,/n are not random but sure numbers, referring to one 
particular observed sample. As a matter of fact this same inconsistency appears 
earlier in the statement that Z—<st,/,/n, by varying ¢,, will run through all real 
numbers. If, as formerly, % and s are random with their variation appropriate to 
the sequence S, then, whatever value we choose to ascribe to t,, say t = 2, the 
expression %— 2s/,/n is also random and depends on the outcome of sampling. 

Apart from this sudden shift in the meaning ascribed to % and s, there are two 
more inconsistencies. To see the first of them, let. us follow Fisher, changing our 
minds about % and s and considering them as sure numbers, determined by one 
particular sample. In this case the inequality ~<%—<st,/,/n would contain no 
random elements at all: the first element, , is an unknown constant, the mean of 
a single population sampled, % and s are fixed by the sample observed, and ¢, 
is the value of the sure variable that we have chosen to consider. In these circum- 
stances, the inequality may either be true or not true and the probability of its 
being true will equal unity or zero and have nothing to do with the probability 
or frequency P(t,) which this same inequality satisfies within a sequence S of many 
‘different’ samples. 

The last inconsistency refers, of course, to the point of view on ~. As we have 
seen above, it is first considered as a sure number, but the passage just quoted 
speaks of the probability of its lying between any assigned limits possible to 
determine from the values of P(t). Assume n = 4 and that the sample observed 
gives x = l0ands = 2. Select ¢, = 0-765 and t, = — 0-765 so that P(t,) = 0-25 and 
P(t,) = 0-75. This would result in the supposed probability P’ of « lying between 
the limits 9-235 < 4 < 10-765, being equal to 1/2. Trying to interpret this result in 
the light of the classical theory of probability, we have to conceive a sequence, 
say S’, of cases in 50 % of which y falls between the above limits. But exactly what 
could this sequence be? Either there is such a sequence and then we must also 
consider other populations ‘which might have been sampled’, and postulate some- 
thing about the distribution of u*, or else the ‘sequence’ must be the degenerate one 
of one element only with the probability P’ equal to either zero or unity, but 
never to 1/2. 

These are the points previously mentioned by the author (Neyman, 1934), 


* This is quite essential. Otherwise there would be an error in Bayes’s theorem. 











146 Fiducial argument and the theory of confidence intervals 


which, from the point of view of classical probability, represent conceptual in- 
consistencies. They are also present in the other passage of Fisher quoted on 
p. 139, but a similar analysis of that passage supplemented by what has subse- 
quently been done by Sukhatme, will reveal errors in algebra of probability laws 
as well. These errors are particularly relevant from the point of view of the contro- 
versies between Bartlett and Fisher. 

The quantities considered in this passage are all dependent on the population 
means 4 and uw’ and on the statistics and s of one random sample and on 2’ and 





s’ of the other. Our analysis will also require the consideration of the population 
variances o? and o’*. We must start by deciding on the random or sure character 
of all these quantities. Fisher’s remark that the two ratios 
= ,_ =» 
te"—* and f= Rat f (37) 
s* 3 

are distributed according to ‘Student’s’ law with appropriate deg rees of freedom 
suggests that 4 and yw’ are treated as sure numbers and that 7, Z’, s, and s’ are 
random. There is no reference whatever to the variances o? and o”?. As nothing 
is disclosed about what distribution they may possess, by analogy with the y’s 
it is natural to treat them as sure numbers also. 

In order to interpret every step in calculations more easily, we shall imagine 
two normal populations 7, and 7, sampled and a sequence A of pairs of samples, 
of n and n’ individuals respectively, drawn independently from 7, and 7,. These 
pairs of samples will determine 7, 3, 2’, and s’, generating distributions appro 
priate to normal populations. Substituted into formulae (37) they will make t and 
t’ vary to generate the two distributions of ‘Student’. 


With this in mind, let us examine the passage in which Fisher writes 
e= 6—d = 3’t' — st. (38 


and comments: ‘Since s’ and s are known, the quantity represented on the right 
has a known distribution, though not one which has been fully tabulated.’ We 
see here the same kind of sudden jump im the point of view on quantities con- 
sidered as is found in the passage analysed previously. Formerly s’ and s were not 
‘known’ but random. Otherwise, the distributions of ¢ and t’ would not have been 
those of ‘Student’ but would have been normal about zero and due solely to the 
variability of Z and %’. Now s’ and s are known sure numbers. Let us allow for this 
shift in conditions and try to visualize the character of the distribution of ¢ for 
fixed s’ ands. Forthis purpose we have to consider not the whole sequence A of pairs 
of samples mentioned above, but only a subsequence B composed only of those 
pairs of samples in which the estimated variances have the same values s and s’ as 
the ones supposed to be ‘known’. The variability of ¢ in the subsequence B will be 
the result of the variability of Z and Z’ only. It is known that the mean of a sample 
from a normal population is independent of the sample variance. Consequently 


~—_ 











i‘ —? } = 1 1. - 
the d dz > will normal 4s the cc 1ex10n 
one ni n inear Witi ynstal Tih t 
> a 3 1 
o!} ) \ 1 pe norn QO. inier re 
An LF } + rn 
‘ - 16 One i ik n restion stribu 
ee ; 
t au mVvi Pivine C i, 
ytY hinge Y y na t t noe +The n yt ; 
$007 np in ming ) \ . iy on tn ox L 
+} r 7 
scribed in subseque tions. However th y be ave ¢ 
{ + 1 . 1 1 : 
bi etween tne H 1 tne rui y 1 ry ) 
loa _ } 
iB & € 
| ] ’ 
i ne ij i ‘ ~ ? y 
T oe 
iT aq ne l sul sequ LUI mS ie ra ) } I 
19 1> . 
i {ee 2 Wich, f _ 
e—= Clal\s $ fisher & any r1ormuia € l 
ae" = . ‘7 
toution of und we hay tetail 1 Su 
> 15 AXy t > > - 
i I COMpimentary er i ed | } ) l 
shers st t] is f ; } 
} ry ’ r 
Db tA ne 
{ ar +} liad hart 
ice $ LiSt ubtio 
t 
: i 
- i ) Sin i 
(s+ * 
t and FR in ord to obtain the probability that z exceeds 
‘ i] pre < T ) coes nm rele T 
‘ > 
4 
4 } 
a { t 
} 
1 
[eA 4 


The relative probability, given R, of z exceeding a fixed number z 


n Sn 
ar had 
ii nad 
Tincipie 


>) note the 
id , 
rs { K. 
y 1 
‘ y 
iT ed 
' t y} < 
ms 
al 
ici 


any give 


(40) 


that is 


i 


P(z>z,| R), will be obtained by integrating (40) for z from z, to +00. There is an 
alternative way of obtaining the same probability. This consists of first finding the 


relative joint probability law given F of t and ¢’. If this is denoted by p(é, t’ | R) 


then 


P} 1| R} {| pli,t’ | R)didt’, 


(41) 











148 Fiducial argument and the theory of confidence intervals 


where the region of inteysiuon w(z,) is determined by the inequality 


z= t’ cos R—tsin R>2. (42) 
p(t, t’', R 
A familiar formula gives p(t,t’| R) = aD (43) 


Whichever way, (40) or (43), is preferred, the resulting probability P{z > z,| R} 
will have the same value and will refer to the sequence C described above. 

Sukhatme has chosen to apply a quadrature procedure to calculate the integral 
(41) with theintegrand equal to the product of two of ‘Student’s’ distributions with 
n—1 and n’ — | degrees of freedom respectively. This is just the error in algebra of 
probability laws mentioned above. The ¢ and ¢’ are distributed independently and 
in accordance with ‘Student’s’ laws only in the sequence A where both the means 
= and 2’ and also the variances s? and s’* are undisturbed in their random and 
independent variation appropriate to samples from normal populations. When 
calculating the probability ‘for a given R’, we do not consider the sequence A 
but only its part C so selected that the ratio s/s’ is constant. This selection disturbs 
the original distribution of s and s’ and is reflected in the resulting joint distribu- 
tion of t and ¢’. 

In our calculations above (26) we have used the letters u and v for what is here 
denoted by ¢ and t’. Consequently, the joint probability law p(t, t’, R) is obtained 
from (32) by merely substituting ¢ for u and t’ for v. The absolute probability law 
of R is easily obtained by integrating (34) with respect to z between the limits 
—oo and +00. The result is 

sin”? R cos”? R 


P _ i ee ee 
n= {n(n — 1) sin? R+n'(n’ — 1) p? cos? R}+n'-2)” (44) 





with c denoting a numerical constant. Substituting (32) and (44) into (43) we 
obtain 





pitt’ | R) = Soe 


bd , , Z Sten 5 oat) > 45 
{n(t? +n — 1) sin® R + n’(t’? +n’ — 1) p? cos? RAM +”) _ 


with $(R, p) denoting a function of R, p, n and n’ only. p(t,?t’ | R) is just the func- 
tion to be integrated to obtain the relative probability given R oft and t’ to verify 
any inequality such as t’cos R—tsin R>z,. As one would expect p(t,t'| R) 
appears to depend not only on R but also on the ratio of the population vari- 
ances p?. 

It follows that, from the point of view of the ordinary theory of probability, 
the Fisher-Sukhatme solution is wrong. The error consists in their confusing the 
absolute probability law of t and ¢’, obtainable by integrating (32) for R, with the 
relative probability law given R of the same variables as given by (45). Some such 
error seems to have been suspected by Bartlett. Repeated denials and the re- 
ference to the extra-logical principle underlying the fiducial theory lead us to 
believe that from the point of view of that particular theory the error is non- 


~- O&O DO + 


J. NEYMAN 149 


existent. While accepting these explanations we may still regret that the earlier 
papers by Fisher and that of Sukhatme do not contain any clue as to how they 
are to be interpreted. 


6: SUMMARY 


1. The theories of fiducial argument and of confidence intervals differ in their 
basic conceptions. The validity of the former requires, at least in some cases, the 
fulfilment of various restrictions of which the theory of confidence intervals is 
totally free, and/or the acceptance of some new principles impossible to deduce by 
the rules of ordinary logic (Yates, 1939; Fisher, 19395). e 

2. The two theories may occasionally give the same numerical results in the 
form of fiducial limits on one side and of confidence limits on the other. The pro- 
blem of estimating the difference of means of two unknown normal populations 
shows, however, that this need not always be the case and that fiducial limits need 
not satisfy the definition of confidence limits. 

3. Bartlett’s criticisms of Fisher’s solution of the problem just mentioned 
seem to be due to his considering the problem from the point of view of ordinary 
theory of probability and ordinary logic. In this light Fisher’s solution does 
contain both conceptual misunderstandings (originally pointed out in the author’s 
paper of 1934) inherent in the very concept of fiducial distribution of a parameter, 
and errors in algebra of probability laws. Since the first references to the new 
principles outside of ordinary logic, which supposedly justify the fiducial theory, 
were published after the publication of Bartlett’s criticisms, the latter seem to be 
perfectly justified and useful. 

4. Owing to a certain flaw in the ideas underlying the fiducial theory which is 
noticeable in passages quoted in §4, it is impossible to insist on any definite 
attitude towards it, except that of doubt. It may be useful, however, to express 
the following conjectures which seem to be very probable. If they are wrong then 
they will be put right and, as a result, the situation will be clarified. 

The present author is inclined to think that the literature on the theory of 
fiducial argument was born out of ideas similar to those underlying the theory of 
confidence intervals. These ideas, however, seem to have been too vague to 
crystallize into a mathematical theory. Instead they resulted in misconceptions 
of ‘fiducial probability’ and ‘fiducial! distribution of a parameter’ which seem to 
involve intrinsic inconsistencies as described in §5. In this light, the theory of 
fiducial inference is simply non-existent in the same sense as, for example, a theory 
of numbers defined by mutually contradictory definitions. 

In earlier stages when the problems treated were very simple, the fallacy 
involved in ‘fiducial probability’ was not apparent. Later on, however, diffi- 
culties appeared and the new principle ‘which cannot be deduced by logic’ seems 
to have been invented to disentangle them in one particular case. But the word 
‘principle’ implies some generality, hence the drift in comments on the samé 














150 Fiducial argument and the theory of confidence intervals 


subjects treated in 1936 and again in 1939. From the point of view of the direction 
of this drift it is perhaps significant that Yates speaks of ‘fiducial statements’ 
possible to make on the ground of probabilities a posteriori and that the paper by 
Jeffreys which professes the equivalence of fiducial theory with that of inverse 
probability appeared in the Annals of Eugenics, edited by R. A. Fisher. 

However this may be, the only thing that the present author ventures to 
profess is that the theory of fiducial probability is distinct from that of confidence 
intervals. 


REFERENCES 


BaRTLETT, M. S. (1936). Proc. Camb. Phil. Soc. 32, 560. 
—— (1939). Ann. Math. Statist. 10, 129. 
CLOPPER, C. J. & Pearson, E. 8. (1934). Biometrika, 26, 404. 
Davin, F. N. & Nseyman, J. (1938). Statist. Res. Mem. 2,105. 
FELLER, W. (1938). Statist. Res. Ment. 2, 117. 
FisHer, R. A. (1925-34). Statistical Methods for Research Workers. London: Oliver and 
Boyd. 
—— (}930). Proc. Camb. Phil. Soc. 26, 528. 
——— (1933). Proc. Roy. Soc. A, 139, 343. 
—— (1935). The Design of Experiments. London: Oliver and Boyd 
—— (1936). Ann. Eugen., Lond., 6, 391. 
—— (1937). Ann. Eugen., Lond., 7, 370. 
—— (1939a). Ann. Eugen., Lond., 9, 174. 
— (19396). Ann. Math. Statist. 10, 383. 
FRECHET, M. (1937). Recherches théoriguzs modernes sur lt théorie des probabilités. Paris: 
Gauthier-Villars. 
JEFFREYS, H. (1939). Theory of Probability. Oxford: Clarendon Press. 
— (1940). Ann. Eugen., Lond., 10, 48. 
MisEs, RicHarp v. (1939). Probability, Statistics and Truth. London: W. Hodge and Co. 
NEYMAN, J. (1934). J. R. Statist. Soc. 97, 558. 
—— (1935a). Ann. Math. Statist. 6, 111. 
—— (1935b). Bull. Soc. Math. Fr. 63, 246. 
—— (1937). Philos. Trans. A, 236, 333. 
—— (1938a). Lectures and Conferences on Mathematical Statistics. Graduate School, 
U.S. Department of Agriculture, Washington, D.C. 
—— (19386). Actualités Sci. Industr. no. 739, p. 25. 
NEYMAN J. & Pearson, E. S. (1933). Philos. Trans. A, 231, 289. 
PEARSON, E. 8S. (1939). Biometrika, 30, 471. 
PEARSON, Kart (1938). The Grammar of Science. London: Everyman’s Library. 
PITMAN, E. J. G. (1939). Biometrika, 30, 391 
PyTKowski, W. (1932). The Dependence of Income of Small Farms upon their Area, the 
Outlay and the Capital Invested in Cows. Warsaw: Series Bibljoteka Pulawska, 34. 
Srarkey, Datsy M. (1938). Ann. Math. Statist. 9, 201. 
‘SrupEnt’ (1908). Biometrika, 6, 1. 
—— (1925). Metron, 5, 18. 
SuKHAtTME, P. V. (1938). Sankhyd, 4, 39. 
Wa tp, A. (1939). Ann. Math. Statist. 10, 299. 
Watp, A. & Wotrowrrz, J. (1939). Ann. Math. Statist. 10, 105. 
We tcu, B. L. (1938). Biometrika, 29, 350. 
(1939). Ann. Math. Statist. 10, 58. 
Yatrs F. (1939). Proc. Camb. Phil. Soc. 35, 579. 





151 


TABLES OF PERCENTAGE POINTS OF THE 
INCOMPLETE BETA-FUNCTION 


ComputeD By CATHERINE M. THOMPSON 


CONTENTS 


Page 
Prefatory Note. By E.S. PEarson . . : : : : - i 
Description of the calculation. By L. J. Comrie and H. O. Harrizy 154 
Methods of interpolation. By H.O. Hartnry . , < - . 161 
Tables ; ; : 168 
PREFATORY NOTE 
By E. S. PEARSON 
THE Incomplete Beta Function ratio has been defined as 
L(p, 4) = Bz(p,q)/B(p, 9g) 
ri p-4 q) ft 
bol.” abt aP-l(] — 2)a-1 dx. ) 
F(p) T(q) J u \ j Wwe (1) 


When the fundamental 7J'ables of the incomplete Beia-function(6) were pub- 
lished by Karl Pearson in 1934 it was realized that they might form a basis for 
shorter tables suited for use in special problems. One such application is in con- 
nexion with sampling theory and the associated significance tests. In carrying 
out these tests it is generally considered that a table giving values of the argument 
corresponding to certain convenient probability levels is more useful than one 
in which the probability integral is listed at equal intervals of the argument. 
Using the transformed variable 
n(1—z 
z= cg OS. (2) 
Be ox 
R. A. Fisher(3) was the first to provide tables of this character, giving values of 
z associated with the 0-05 and 0-01 probability levels. Since then, a table for the 
0-001 level has been calculated by Colcord & Deming (2) and one for the 0:20 level 
by H.W. Norton (in Tables edited by Fisher & Yates(4)). In the terminology of 
the analysis of variance, if 
S, is a sum of squares depending on v, degrees of freedom, and 
S, is a sum of squares depending on v, degrees of freedom, 
S 
. 
then 2 == 3 
S, + 8, : ( ) 


and Vv, = 2q, Vg = 2p. (4) 











152 Percentage points of the incomplete beta-function 


For tests of significance, z is easily computed and the fact that, when v, and 
v, are large, it tends to be normally distributed about zero with a standard 


deviation ¢ ( , f ay aa 
re ke) Peale 5 
’ \2\n,  ng/J’ 


lent considerable weight to its tabulation rather than that of x. Experience has, 
however, shown that in a number of problems it is the percentage levels of x of 
the Beta-distribution that are directly required; this fact and the desirability of 
having available a greater number of percentage levels* are reasons for the issue 
of the present tables giving five significant figures for x. They may be regarded 
as a supplement to the original 1934 Tables (6). 

Conversion from x to-z or to Snedecor’s(7) F, where 


F= YS) == p(i—2) : (6) 
v8, qu 
is straightforward. Tables of the seven percent=ge levels for F have, in fact, 
been already computed, and it is hoped to. include them in a new edition of 
Tables for Statisticians and Biometricians. 

Since the completion of the marginal columns for the tables of F involved some 
fresh computation, it seemed useful to extend the work so as to provide new 
tables giving thirteen percentage levels for y*. These tables are printed in a 
separate contribution oa pp. 187-191 below; they have been calculated to six 
significant figures and cover the range of degrees of freedom v = 1(1)30 and 
40(10)100. 

A word of comment is perhaps desirable as to the introduction of the notation 
v, v, and v, for degrees of freedom in place of the customary n, n, and n,. The use 
of the letter n, both with and without a subscript, to denote a group frequency has 
been so long established in publications associated with Biometrika and elsewhere 
that it seemed desirable in these tables to avoid confusion by adopting a fresh 
symbol for degrees of freedom. The letter f has sometimes been used, but the 
notation is not altogether satisfactory; the letter v is that employed by Yule & 
Kendall ((8), p. 415), and its use here should be free from any ambiguity. 

Reference has been made above to the existence of problems where the direct 
requirement is for the percentage levels of x rather than z or F’. A case in point is 
that of the multiple correlation coefficient in samples from uncorrelated normally 
distributed material; here R® follows exactly the Beta distribution. In other 
cases the distribution may be used to give an approximate fit to probability 
functions whose exact equations are either unknown or difficult to handle. Thus 
in his Preface to the Tables of the Incomplete Beta-function, Karl Pearson stated 
that his first interest in the function was stimulated by the discovery of how. 
accurately it could be made to graduate a hypergeometric distribution. The 


* The percentage levels tabulated are: 50, 25, 10, 5, 2-5, 1 and 0-5. 


d 


CATHERINE M. THomPson 153 
fitting was carried out by equating the first four moments of the Beta and hyper- 
geometric distributions. 


Again, if a random variable can assume only values between 0 and 1, if it has 
a mean value of 1; and a second moment about zero of 3, then the probability law 


. e+e) _ 
f( ) T(p) Tq)” (1-2) . (7) 
where P = My (Hy — Ha)/(Ha— 4"), Y= (1-4) (4 — #2) (2 — 2?) (8) 


will often give a very prise approximation to the true law. Use has been made of 
this fact by Neyman & Pearson(5), Bishop(1) and others in determining prob- 
ability levels for the likelihood ratio criterion Z, used in testing the homogeneity 
of a series of variances and covariances. The accompanying tables are directly 
applicable in such problems. 

Miss Catherine M. Thompson (now Mrs V. G. Grylls) has been responsible for 
by far the greater part of the numerical work involved in the production, and the 
tables should rightly be associated with her name. Owing to the special character 
of the Beta-distribution, which makes it necessary to vary the method of com- 
putation in different parts of the range of variables covered, a considerable 
amount of exploratory work and some careful planning was needed in the 
development of the lines of attack. This essential aid has been provided by 
Drs L. J. Comrie and H. O. Hartley of Scientific Computing Service Ltd., in 
whose office Miss Thompson carried out most of the work. Since the evacuation 
of University College at the beginning of the war this help, both in advice and in 
accommodation, has been more than ever essential. In the following pages 
Drs Comrie and Hartley have described the various methods used in computation 
and have also discussed the problem of interpolation. 

The Editor is glad to take this opportunity of expressing his warm apprecia- 
tion of this collaboration, which has made it possible to carry through to a suc- 
cessful conclusion a piece of work that had long been in view. 


REFERENCES 


(1) Brisuop, D. J. (1939). Biometrika, 31, 31-55. 

(2) Cotcorpb, C. G. & Dremine, L. 8. (1935). Sankhyd, 2, 423. 

(3) FisHer, R.A.(1925). Statistical Methods for Research Workers. London: Oliver and Boyd. 

(4) Fisner, R. A. & Yates, F. (1938). Statistical Tables for Biological, Agricultural and 
Medical Research. London: Oliver and Boyd. 

(5) Neyman, J. & Pearson, E. 8. (1931). Bull. int. Acad. Cracovie. Série A, pp. 460-81. 

(6) Prarson, K. (1934). Tables of the Incomplete Beta-function. London: Biometrika. 

(7) SnepEcoR, G. W. (1934). Caitculation and Interpretation of Analysis of Variance and 
Covariance. Ames, Iowa: Collegiate Press Inc. 

(8) Yunus, G. U. & Kenpbatt, M. G. (1937). Introduction to the Theory of Statistics. 11th ed. 
revised. London: C. Griffin and Co. 


Biometrika xxx tr 











154 Percentage points of the incomplete beta-function 


DESCRIPTION OF THE CALCULATION 
By L. J. COMRIE anp H. O. HARTLEY 


INTRODUCTION 
In terms of Karl Pearson’s notation the incomplete Beta-function B,(p,q) is 
defined by the integral 
zx 
B,(p.q) = | 2-4 —ayde. (1) 
0 


For x = 1 we have the complete Beta-function B,(p,q), commonly denoted by 
B(p,q) and defined by the equation 
B( p,q) = 
which is identical with (1) for x = 1. 
The tables give the percentage points of the ‘normalized’ incomplete Beta- 
function wi I(p+q) {’ 
I(p) Pq) Jo 
They are defined as the roots « of the equation 


I'(p) Tq) 
I'(p+q)’ 





(2) 


L,(p, q) xP-l(1 —x)t-" dz. (3) 


L(p,q) = P (4) 
for given P, as functions of the parameters p and g. Seven tables have been pre- 


pared corresponding to seven selected values of P, namely 0-005, 0-01, 0-025, 
0-05, 0-10, 0-25 and 0-50. From the formula 


LAP, q) =1 —T_2(4, Pp) (5) 
the roots of (4) follow immediately for P = 0-75, 0-90, 0-95, 0-975, 0-99 and 
0-995, 

In each table xz is tabulated for 
v, = 2g = 1(1)10, 12, 15, 20, 24, 30, 40, 60, 120 and oo 
Ve = 2p = 1(1)30, 40, 60, 120 and oo 


With 2q¢ as column heading and 2p as row headings the arrangement of the 
tables corresponds to that of the upper percentage points of R.A. Fisher’s z and 
G. Snedecor’s F, v, and v, being the degrees of freedom. 

Karl Pearson in his introduction to the Tables of the Incomplete Beta-function (6) 
says: ‘No single method has hitherto been discovered for evaluating numerically 
the incomplete Beta-function for all values of p and qg.’ Those who have done 
numerical work on this function and its various transformations will agree that 
the main difficulty is the limitation in scope of any single method and the variety . 
of methods required to deal appropriately with the range of the parameters p 
and q and of the variable x. This difficulty is enhanced when the task is the 


£*% 


\y 


Ss Se 


CATHERINE M. THOMPSON 155 


calculation of percentage points of x rather than the tabulation of the function 
I,(p,q). A large number of numerical processes, each specially designed to deal 
with certain ranges of the new tables, had to be employed to accomplish this task. 

The choice of a suitable method is largely determined by three important 
factors: 

(a) The accuracy required for x. This was fixed at five significant figures. 

(6) Existing tables available as a starting point. These are: Karl Pearson’s 
Tables of the Incomplete Beta-function(6). Fisher’s tables of percentage points 
of z(4), and corresponding tables of its transformation F\(3), and finally Karl 
Pearson’s Tables of the Incomplete I’-function (5). 

(c) The number and relative position of percentage levels, P, and the values of 
p and q for which the percentage points x are to be calculated. 

The importance of (a) and (b) is obvious, but (c) is no less relevant. It will, as 
a rule, be uneconomical to produce an interpolable table of [,(p,q) merely to 
obtain a single percentage point by inverse interpolation. If, however, a larger 
number of percentage levels is to be calculated the method becomes worth while. 
In this connexion it should be remarked that the original plan was to produce 
tables for P = 0-005, 0-01, 0-05 and 0-10 only, and that it was decided at a later 
stage to add tables for the remaining values of P. 


SUMMARY OF NUMERICAL METHODS EMPLOYED 


(1) Inverse interpolation in Karl Pearson’s tables. A large number of per- 
centage points were obtained by inverse interpolation in the tables of [,(p, q). 
The particular tables required were differenced on a National machine(2) and 
six significant figures of 2 found by the method of inverse interpolation described 
by L. J. Comrie(2), taking into consideration the higher order differences. The 
method breaks down when for large p and small g (or for large ¢g and small p) the 
tabular interval of 0-01 is too wide for the tables to be interpolable. With the 
notation adopted, the difficulty arises in the top right-hand corner of the tables 
when for large g and small p the root x is smaller than 0-05. Roughly speaking, 
roots greater than 0-05 could be obtained from Pearson’s tables. 

(2) Interpolation in tables of percentage points. It will be noted that in the 
present tables percentage points are given for certain values of p and q for which 
the function J,( p,q) has not been tabulated by Pearson. Such points occur in the 
rows 2p = 23(2)29, 120 and in the column 2q¢ = 120. Whilst the calculation of the 
two marginal lines (2g = 120 and 2p = 120) necessitated special methods, the 
extra entries in the interior of the table were easily obtained by p-wise inter- 
polation, using suitable formule of the Lagrangian type. 

(3) Extension of the tables of I,(p,q). The well-known recurrence formula 


I. p,q) = x1,(p—1,q)+ (1-2) L(p,¢-1) (6) 


is particularly convenient if I,(p,q) is required for a lattice work of integer 


Il-2 














156 Percentage points of the incomplete beta-function 


co-ordinates 2p, 2q in a certain range and for a limited range of x. For small p 
and large or moderate q all percentage points of I,(p,q) are clustered near 0. It 
appeared worth while, therefore, to use the recurrence formula (6) to extend 
Pearson’s table by producing 


L(p,q) for 2p =1(1)7, 2q = 1(1)30 
and x = 0-001(0-001)0-012, 0-015 and 0-025 
in order to obtain more of the required percentage levels x by inverse interpolation 


at intervals 0-005 and 0-001 respectively.* This made it possible to obtain a 


large number of percentage points x in a range where Pearson’s tables are not 
interpolable. 


To start the recurrence process, the functions 


L(3,9), F-(p.4), (1,9) and I,(p, 1) 


are required for the above ranges of p and q. The first two functions were obtained 
from the expansions 











1 2 (¢—1) 2 (q—1)(q-2) 
1 ——— "ee pee # 
413.9) =a \"-s 11 * +s a1 * 
2(q—1)(q—2)(q—3 
— 7a Naa 9) aay. (7) 
1 (xP gt 1.3272 1.3.5aP43 
(p,}) = ro prewieetenen 6 1. ae 
LAP. 3) = BaD \p tPAl@sl) + B.2pt2)*B3slpeatoy 8) 


In dealing with the expansion of J,(4,q), all terms required for 10-decimal 
accuracy in J).o.5(4, 7) were first produced and then reduced for smaller values of 
x by multiplying by the appropriate power of 2/0-025. The quantities 


1 l 
1,(1,q) = 1-=(1—2)t, 1,(p,1) =—2? 
(1,9) 7 ( ) (p, 1) > 


were produced with the help of logarithmic tables. The remaining functions 
1,(1-5,q), 1,(25,q), 1,(35,q) and 1,(2,q), 1,(3,q) 


were then obtained by four recurrences covering the following combinations of 
the parameters 2p and 2q. 


Odd values of 2p and odd values of 2q¢ 


Even ‘3 odd be 
Odd * even a 
Even a even = 


Having thus dealt with the main body of the tables we now turn to the more 
difficult problem of calculating entries x near the margin of each table of per- 
centage points. In what follows, methods will differ according to whether 2p is 
odd or even. 


* [It is hoped that at a future date it will be possible to publish these extended values of 
I,( p, 7) 28 a supplement to the Tables of the Incomplete Beta-function. Ed.] 


CATHERINE M. THOMPSON 157 


(4) Building up the polynomial pari of (p,q) from a constant high-order 
difference (2p even). For integer p, 


B,(p, 4) = | “x?-1(1—2)t-1dz, (9) 
0 


may be expressed as a polynomial in (1—2). We have 





p-l1 : =) i 
B(p,q)— B,(p,q) = (1-2) (-)§ oP AEP, (10) 
i=0 qtt 
or, introducing y = 1—z, 
p-1 Zz 
B " ~¥ortZ_.. ll 
Wa P) = yt E (-K ORE (11) 


B,(q, ») is therefore the product of the gth power of y and a simple polynomial in y. 
If p is small (2p < 12) this polynomial can be built up easily on the National 
machine(2). As an example, for 2g = 60 and 2p = 6 we have the equation 
14880{ B(3, 30) — B,(3, 30)} = 14880B,(30, 3) = y°°(496 — 960y + 465y?). 

The polynomial 496 — 960y + 465y? was built up on the National machine from 
its constant second difference for values of y beginning at y = 1 and descending 
at interval 0-001. The polynomial values were multiplied by the 30th power of 
the argument and the products checked by differencing. Finally the percentage 
points y (or x) were found by inverse interpolation. This method was used for 
2q = 40, 60, 120 and 2p = 2, 4, 6, 8, 10, 12. For larger values of p the building 
up process becomes too laborious. On the other hand, with increasing p the 
percentage points x increase in value, so that for values of 2p greeter than 12 and 
not exceeding 100 results could be obtained from Pearson’s tables by inverse 
interpolation. 

(5) Taylor expansion at approximate percentage point (2p even). It remains, 
therefore, to consider the last column 2q = 120 for 2p > 14. For small z (i.e. values 
of y in the neighbourhood of 1) the terms of the expansion (11) have to be calculated 
to a very high degree of accuracy, since these terms have alternating signs and 
many significant figures are lost when adding to produce B,(q, p), which is very 
small. Since seven significant figures are réquired for B,(q, p), in some cases 20 
decimals are required for the terms in (11), and their computation becomes 
laborious. A method was, therefore, evolved whereby B,(q, ») has to be calculated 
for one single three-decimal argument only. Although the function B,(q, p) is 
difficult to compute, its derivatives are easily calculated. It is, therefore, natural 
to use a Taylor expansion 

Bysn(d, P)— BY(q, p) = hy? (1 -y)?> 14 (2 A ne (12) 
Rte: A 

With a known* three-decimal approximation y to the exact percentage point 

y+h the main task consists in the calculation of B,(q, p). This was done from 

formula (11), using tables of powers(9), and a high capacity electric calculating 


* It will be shown later how this approximation was obtained. 











158 Percentage points of the incomplete beta-function 


machine. The correction h to the approximation y was then easily obtained by 
iteration from equation (12). With h, = 0 the iteration is given by 


‘q-1 —l 
es i 4B] 1+ yh,(2—* 2) 4 4 
\ . 29s 
Be elie ite 
Since the numerator, 4B, of equation (13) does not vary, the corrections h,, hs 
and hs are easily produced in turn, three steps being sufficient in most cases. 
Occasionally the term arising from the third derivative had to be included in the 
denominator. In this way the percentage points were calculated for 


where 





(13) 


2p = 14(2)22 2q = 120, 
2q = 14(2)22 2p = 120, 
the values for 2g = 14, 16, 18, 22 and 2p = 120 being required for checking by 
differencing. 
A word has to be added concerning the three-decimal approximation to the 
percentage points of x for 2g = 120 and 2p = 120. In some cases such values 


could be obtained from Fisher’s table of percentage points of z using the trans- 
formation p 
Ee nee 
ptge 
In cases where such values are not-available they were obtained by harmonic 
interpolation. More precisely, the finite limits 


lim 2gx and lim 2p(1—2) 
q>@ p>a 
p = constant q = constant 


were first obtained from the functions I(u, p) of Tables of the Incomplete ’-function. 
The above limits depend on p and q and are given by 2u ,/p and 2u ,/q respectively, 
where w is the root of J(u,p—1)= P and I(u,q—1) = 1—P respectively. The 
quantity 2qz, being known for the arguments 1/2¢ = 0/120, 2/120, 3/120, 4/120, 
5/120 and 6/120, was then obtained (to about four-decimal accuracy) for the 
argument 1/2¢ = 1/120 by a Lagrangian interpolation formula. Similarly the 
quantity 2p(1—2) was calculated for 1/2p = 1/120. 

We are left, therefore, to consider the entries in the column 2¢ = 120 with 
2p odd, and in the row 2p = 120 with 2q odd, and also certain entries in the top 
right-hand corner for 2q¢ > 30 and odd 2p < 13. 

(6) Binomial expansion with fractional index (2p odd). In the top right-hand 
corner values of x are small and it is therefore to be expected that the expansion 


co) -p+t 
B(p,q) P = X(-)'c 
(p,q) P = &(-) Cf" 





(14) 


is reasonably convergent for such values of x. 


CATHERINE M. THomMPpsoN 159 


The coefficients of the expansion (14) were calculated for 2g = 1(1)30, 40, 60 
and 120, and 2p = 1(1)9. The root xz of the equation (14) was then found by a 
suitable iteration process. 

In some cases (2p = 1,1 <2q<30) it was found convenient to invert the 
expansion (14). With 2p = 1 the equation (14), if regarded as an expansion in 
powers of ,/z, may be reversed to yield ,/z as an expansion in powers of B(p, q) x P. 
Because of the particular importance of the case 2p = 1 (the ¢-distribution) it is 
of interest to give here examples of formule from which any percentage point x 
may be obtained directly by substituting the corresponding percentage level P. 


If D = 4B(p,q)P = 4B,(p,q), the first five terms of the reversed expansion 
are as follows: 


fe = D+ 2 ps GCA") ps 








30 
(q—1) (127q— 1819+ 34) ,,, 
na 630 - 
(q— 1) (436998 — 62859? + 3042g— 496) _, 
+ 22680 D® +..., 


from which the expansion for any particular gq may be worked out without 
difficulty. Thus for g = 10, 


fac = D+3D5+ 19-8D5 + 163-2D? + 1496-2D9+..., 
and for g = 25, 


Jx = D+8D8 + 136-8D* + 2900-3D* + 68162-0D* + .... 


(7) Numerical integration. For large p and qg, when the integral [,(p,q) 
approaches the normal probability integral, a variety of methods has been 
developed (6,7,8). With mechanical computing aids available, numerical integra- 
tion appeared to be the simplest. The integrand x?—1(1 — x)?-! represents a smooth 
curve and was produced at interval 0-01 with the help of logarithmic tables and 
checked by differencing on the National machine. Numerical integration was 
performed by Gauss’ formula and the integral B,( p,q) checked by differencing. 
Finally, « was obtained by inverse interpolation and checked by the application 
of Taylor’s expansion at the tabular value nearest to x. This method was used for 
2q = 120 and 2p = 24, 26, 28, 30, 40, 60 and 120 and also for 2p = 120 and 
2¢ = 24, 30, 40 and 60. 

For 2¢ = 120 and 2p = 7(2)29 all percentage points were obtained by p-wise 
interpolation between the entries 2p = 2(2)30, 40 and 60, using appropriate 
formule of the Lagrangian type. 

(8) Approximation by the incomplete I'-function. It remains to consider the 


entries for 2p = 120 and 2g = 1(2)9. 


There appears to be a lack of suitable methods for obtaining accurate results in 
this range of q for isolated large values of p. Three-decimal approximations 2, to 








160 Percentage points of the incomplete beta-function 


the percentage levels may be obtained by harmonic interpolation as described 
in §(5).* To obtain the correct percentage points (« = x) +h), the main task 
consists in calculating J,,(60,q) to six places of decimals. This was done with the 
help of a recently developed approximate formula giving J,(p,q) in terms of the 
incomplete I’-function.+ This formula is akin to a Taylor expansion of [,( p,q) 
in powers of 1/2p at 1/2p = 0 (2p = 00) and may be written as follows: 





er ff ) 
1-—L(p,¢)zI(u,q—1 Fa ah ——S_4 3-4}, 15 
oP. d) = 10d 1)+ Tay lap * Bp pe} i 
where set a and 2-9) 


and the terms 7; are dependent on A and q only, the first two being 
T,=q-1-A 
T, = 3¢°— 39° +39—$+ (—30° + 7%9q—$)A+ (q— 6) A2—- HA. 
This formula is very accurate for large p and small or moderate g. The terms 7; 
and T; were calculated in each case for 
- 
juiate™, 
Xo 
where x, denotes a suitable three-decimal approximation to the true percentage 
point. To obtain 7, we make use of the fact that equation (15) should yield 
T,(50,q) to seven-decimal accuracy. Since [,(50,q) is obtained to that accuracy 
from Pearson’s table and since 7,, T, and T;, depend on A and q only, we may use 
equation (15) to determine 7, by substituting p = 50, A= Ay, x= 2, = cr , 
0 


u= mam. With 7,, T, and T; computed, J,,(60,q) is easily obtained from 
1V 
equation (15). Finally the exact percentage point x)+h is calculated by the 
iteration process (13). 
CHECKS 

The main body of each table of percentage points (i.e. the interior of each table) 
was checked by differencing p-wise and q-wise at interval }. For large q and 
moderate p, x may be considered as a function of 1/2q and differenced at interval 
1/120, i.e. for 1/2qg = 0/120 (1/120) 6/120. Similarly x may be differenced for large 
p and moderate values of q for 1/2p = 0/120(1/120) 6/120. Four significant figures 
may be checked in this way, thus eliminating any possibility of serious errors. 
For smell p and large q, the quantity z is almost linear in 1/2q so that a good check 
was given by examination of the product 2g, which is almost constant. Never- 


* In this case it was sufficient to use a Lagrangian formula for interpolation between values of z _ 


(with z = 1 for 2p = «), instead of performing the more complicated interpolation between values 
of 2p(1—2). 


+ The derivation of this formula is given in a paper by H. O. Hartley which will, it is hoped, 
be published in the next issue of Biometrika. 


CATHERINE M. THOMPSON 161 


theless, the only available check to guarantee five-decimal accuracy at the margins 
was recomputation. As far as possible repetition of the method employed in the 
first instance was avoided. Thus inverse interpolation was replaced by direct 
interpolation or by a Taylor expansion at a tabular value; iteration processes 
were varied in the formule employed. 


METHODS OF INTERPOLATION 
By H. O. HARTLEY 
INTRODUCTION 


IN so far as the table is required in connexion with standard tests of significance 
the user will be concerned with obtaining x for any percentage level P and for 
integer values of vy. = 2p and v, = 2q. 

The values of P and the row and column headings (2p, 2¢) have been selected 
in such a way that the user will generally find the required value of x tabulated. 
Moreover, for most of the applications it. suffices to estimate roughly the magni- 
tude of interpolates from an inspection of the table. In some cases, however, 
ir.terpolation to about five-decimal accuracy is necessary. The problem of inter- 
polating between corresponding entries in different tables of percentage points 
(interpolation P-wise) will, it is hoped, be dealt with elsewhere and we are here 
only concerned with interpolation in each individual table of percentage points 
x(2p, 2q) to find x for any combination of integer arguments 2p, 2q.* 

In the present tables interpolation to integer arguments vy, = 2p, v, = 2q¢ 
will occur in three different forms: 

(1) Single-entry interpolation q-wise in the range 1 < 2p < 30 and 10 < 2g <0. 

(2) Single-entry interpolation p-wise in the range 1 < 2¢ < 10 and 30 < 2p<oo. 

(3) Double-entry interpolation for 30 < 2p, 10 < 2q. 

If both 2p and 2gq are large, interpolation in the tables is impractical, and it 
was therefore necessary to add a fourth section, namely: 

(4) Approximate calculation of percentage points if both p and q are large. 

It will be noted that, following the lay-out adopted in other tables of per- 
centage points (3, 4), the column headings v, = 2q = 20, 24,.30, 40, 60, 120 
and oo are in harmonic progression. If, therefore, 1/2q is used as a variable, 
these columns form a tabulation of x at equidistant intervals of the variable 
1/2q. The same harmonic progression is given for the row headings v, = 2p, 
although here the tabulation at unit interval goes up to 2p = 30, because of 

* Fractional values of 2p and 2g occur in a number of applivations when the percentage points 
of certain Pearson-type curves are required. In such cases it will be found most convenient to 
apply single-entry interpolation formule, first in one direction (p or q) and then in the other. 
Methods akin to those given here cover the range 2q>10, 2p>30. For the range 0<2p<30, 
0<2q<10, successive single-entry interpolation (p-wise or g-wise) at unit interval should afford 
no difficulty provided the arguments of the interpolate do not lie within the strips 0<2p<3, 
0 <2q <2. Within these strips interpolation cannot be carried out without the aid of auxiliary tables. 








162 Percentage points of the incomplete beta-function 


the importance of these values for certain tests of significance. The use of the 
variables 1/2g and 1/2p greatly facilitates interpolation .but even with this 
device (known as harmonic interpolation) high-order interpolation formule 
have to be used in many cases, if the accuracy of the tabular values is required. 

To facilitate single-entry interpolation, therefore, an auxiliary table of Lagran- 
gian coefficients has been prepared. Although this auxiliary table has been 
specifically designed to meet the requirements of the present tables of percentage 
points of x, it is given and described in a separate paper (p. 183) since it is felt 
that it will have a wider application to any table of percentage points with a 
similar lay-out. 


1. SINGLE-ENTRY INTERPOLATION q-WISE 


No interpolation is required for the range 1<2q<10(1<v,<10). For 
v, = 2q¢> 10 interpolates are obtained with the help of the auxiliary table on 
pp. 183-5 of this issue, and its use is best explained in terms of an example. 

Example 1. Find the 5 % point corresponding to 2p = 26, 2q = 74. 

In the auxiliary table (p. 185 below) enter the row headed 74, that is, the row 
whose heading is equal to the value of 2q¢ for which the interpolate is required. 
The entries in this row are the (Lagrangian) multipliers in a sum of products 
which yields the interpolate x. The corresponding multiplicands are taken from 
the table of 5% points x (2p, 2q)..We enter the row headed 2p = 26 and select 
entries x(26, 2q) for 2g = 20, 24, 30, 40, 60, 120 and oo. These correspond to the 
column headings in the auxiliary table. The sign of each product is also given at 
the top of the columns. We have, therefore, 


x(26, 74) = +0°395 16 x 0-005 867 — 0-357 56 x 0-045 623+0-°313 14 x 0-162 013 
— 0-259 66 x 0-372 737+ 0-193 79 x 1-018 370+ 0-110 24 x 0-247 951 
0-164 64. 


This may be compared with the exact value 0-164 637 obtained by inverse 
interpolation from Pearson’s tables. 

The accuracy of the interpolates depends on 2p and 2q and (to a lesser extent) 
on the percentage level P. In favourable cases, if 2p is small and 2g moderate, 
the interpolate is accurate to 5 places of decimals. In the least favourable cases, 
for 2p near 30 or 2q large, the fifth decimal of the interpolate may be in error. One 
more example is given to demonstrate the use of the auxiliary table. 

Example 2. Find the 50 % point for 2p = 11, 2q = 17. 

Entering the row 17 in the auxiliary table and the row 2p = 11 in the table 
of 50 % points we have 

x(11,17) = + 0-525 38 x 0-003 097 — 0-476 96 x 0-037 459 + 0-419 02 x 0-409 711 
+0°348 45 x 1-213 958— 0-307 07 x 0-856 212 + 0-260 64 x 0-315 162 
— 0-208 18 x 0-048 257 
= 0:387 62. 


r 


se 


CATHERINE M. THOMPSON 163 


It will be noted that for 2g = 16, 17, 18 and 19 two alternative rows are given in 
the auxiliary table, one (which has been used in the above example) is under the 
heading ‘Harmonic’ Lagrangian coefficients. The other row contains ‘Ordinary’ 
Lagrangian coefficients. It is in this range of 2q that there is little to choose 
between the merits of ordinary and harmonic interpolation, and the use of both 
methods provides a good check. Reworking the above example and using ordinary 
Lagrangian coefficients we have 


x(11,17) = —0-553 46 x 0-306 397 + 0-525 38 x 0-780 000 — 0-476 96 x 0-983 025 
+ 0-419 02 x 1-258 272+ 0-348 45 x 0-289 546 — 0-307 07 x 0-040 124 
+ 0-260 64 x 0-001 728 
= 0-387 62. 


There is satisfactory agreement between the two interpolates and the exact 
value (obtained from Pearson’s table) which is 0-387 619. 


2. SINGLE-ENTRY INTERPOLATION p-Wisn 


No interpolation is required for 1 < 2p < 30 (1 <v,< 30). For vy, = 2p > 30 we 
again use the auxiliary table on pp. 184, 185 of this issue. This time, however, the 
argument 2p of the interpolate determines the row to be entered.in the auxiliary 
table, whilst column headings of this table are made to correspond to selected 
rows in the table of percentage points. The method is best explained by the 
following examples. 

Example 3. Find the 0-5 % point corresponding to 2q¢ = 4 and 2p = 96. In 
the auxiliary table enter the row headed 96. The entries in this row are the 
Lagrangian multipliers. The corresponding multiplicands are taken from the 
table of 0-5 % points. We enter the column headed 2q = 4 and select entries 
x(2p,4) for 2p = 20, 24, 30, 40, 60, 120 and oo which correspond to the column 
heading in the auxiliary table. 

The sign of each product is also given at the top of the columns. We therefore 
have 


x(96, 4) = +0-491 44 x 0-005 875 — 0-550 98 x 0-044 647 + 0-618 64 x 0-152 207 
— 0-695 71 x 0-318 909+ 0-783 70 x 0°558 090+ 0-884 42 x 0-669 708 
— 1-000 00 x 0-022 324 
= 0-857 94, 


which differs by 2 units in the fifth decimal from the exact value (0-857 92) 
obtained by inverse interpolation from Pearson’s tables. 











164 Percentage points of the incomplete beta-function 


Example 4. Find the 5 % point corresponding to 2p = 80 and 2q = 30. We 
have 


x(80, 30) = + 0-246 39 x 0-006 836 — 0-292 08 x 0-052 734+ 0-352 00 x 0-184 570 


— 0-433 21 x 0-410 156+ 0-548 07 x 0-922 851+ 0-720 16 x 0-369 141 
— 1-000 00 x 0-020 508 
= 0-624 69. 


The exact value is 0-624 75. 


Again, the accuracy of the interpolates depends on 2q, 2p and to a lesser 
extent on P. Five-decimal accuracy is obtained for small 2g and moderate 2p, 
whilst only 4 decimals are reliable if 2q is near 30 or 2p is large. 


3. HARMONIC DOUBLE-ENTRY INTERPOLATION 
In this section we deal with interpolation in the range 
30<2p<o, 10<2¢<0, 


provided the arguments of the interpolate are not ‘too large’. The exact meaning 




















we 4 3 2 1 0 
2q 5 
120 2q 10 12 - 15 20 30 60 ry 
D> 
6 20 x a ae Pst ~ rt oa a 
5 24 | eo em Sa per ” ai o— ne 
Se _ a al ae -_ 
4 30 | eo Po heal ad al ae we er 
eal ae eee Pe rn Ye 
3 40 i ¢ eS = 56 _e _e —_® 
et —_—_ a _— -_ _ 
2 60 ited ——" a" on nasil er Pe coal eo 
a” "sf a _ —_— —_ 
1 120| e— oS ie ae ae wae ee 0 
0 e Or eel pe iol goer: —— eee gare . 
Fig. 1. 


of this restriction is that if the methods described below are used to find inter- 
polates in the range 


60<2p<a, 40<2¢<0, 
the results obtained are unsatisfactory. In this range the user should, therefore, 
proceed on lines described in section 4. 
The method is essentially double-entry interpolation between points of the 
lattice work shown in Fig. 1. 
The 5, 24, 1 and } % values (i.e. the quantities x for P = 0-05, 0-025, 0-01 and 


0-005) are practically linear to three-figure accuracy in the diagonal direction 
indicated by the broken lines in Fig. 1. 


he 


nd 


ion 


CATHERINE M. THompson 165 
To explain the method it will be convenient to regard the percentage points 
x as functions of 120 60 
ie i 
and to introduce the notation 
x[9,§] = x(2p, 29). 
The relation between the argument 9, € and 2p, 2q is demonstrated in Fig. 1. 
To obtain the interpolate x for any p, g in the above range calculate 


60 120 

—=— and 7=— 

2q 2p 

and find = = integral part of £, 


H = integral part of », 








w= (€-5)+(y-H)-1. 
[H +2, 5} A 
J 
4 7 
7 
Ya 
- Ate 
(H+1, +1) % rn _ 
E,97 7 
Fs 
r Fi 
Biv i 
P 
“ i 
es E (H, +1) 
Fig. 2. 


If « is positive consider the parallelogram with vertices at [H, 5+ 1], [H+1, 5], 
[H+ 1,5+ 1] and [H+ 2, 5] (see Fig. 2). Now calculate two interpolates x, and 
x, at points P, and P, from the (approximate) formule 


a, = wz[(H+1,5+1)+(1 Reni 


ss (16) 
, = px[H + 2, 5)+ (l—y)2[H+ 1, 5] 
and finally find the interpolate z[7, £] 
x(n, €] = (E—5) x, +(S+1-£) 25. (17) 


If » is negative the points P, and P, will be at distance y» below the points [H, 5+ 1] 
and [// +1, =] respectively, and we have the formule 


a, = —pa[H—-1,5+1)+(l+,)e[H, 54+) 
ty = — pal H, 5)+ (14+ yp) af + 1, 5] 


in place of equations (16). 











166 Percentage points of the incomplete beta-function 
Example 5. Find the 1 % point x for 2p = 42, 2q = 16. 
We have 
£=3-75, 9=2-8571, F=3, H=2, w=0-6071. 


To apply formule (16) the tabular values are taken from the table of 1 % points, 
where we find to 4-decimal accuracy 


a[3, 4] = x(40, 15) = : 5140, [4,3] = 2(30, 20) = 0-3705, 
a[2, 4] = x(60, 15) = 0-6297, [3,3] = 2(40, 20) = 0-4578. 


Applying the equations (16) we obtain 


x, = 0-6071 x 0-5140 + 0-3929 x 0-6297 = 0-5595 
X_ = 0-6071 x 0-3705 + 0-3929 x 0-4578 = 0-4048. 


Finally we calculate 


a[ 2-857, 3-75] = x(42, 16) = 0-75 x 0-5595 + 0-25 x 0-4048 = 0-5208. 
The exact value obtained from Pearson’s table is 0-5163. If higher accuracy is 
required we have to improve the approximate relations (16) by adding the 
second-order difference effect. If this is done we obtain 


= 05555, 2,=0-4016 and 2x = 0-5170, 


which agrees satisfactorily with the exact value. If this method is applied to the 
tables of 10, 25 and 50 % points and if a similar precision is required, the second- 
order difference effect should also be considered when interpolating along the 


diagonals. In such cases the right-nand side of equation (17) should have four 
terms. 


4, CALCULATION OF PERCENTAGE POINTS IF BOTH p AND g ARE LARGE 

If x is required for values of 2p and 2q in the range 2p > 60, 2q > 40, inter- 
polation between the tabular values is not possible because of the singularity of 
x at 2p = 0, 2g = w. In this range therefore x has to be calculated ab initio. 
Certain approximate formule for the incomplete beta-function are valid in 
this range(8,10). These formuiz, whilst useful for a calculation of P = [,(p, q) 
as functions of x, 2p and 2q, cannot be easily inverted to yield x for the given 
percentage levels P. 

Auxiliary table 


y = normal deviate at level P;, A = 4(y?+3) 


P 0-50 0-25 0-10 0-05 0-025 0-01 0-005 
y 0-0000 0°6745 1-2816 1-6449 1-9600 2-3263 2-5758 
A 0-5000 0-5758 0°7737 0-9509 1-1402 1-4020 1-6058 


A-} 0-3333 0-4092 0-6071 0-7843 0-9736 1-2353 1-4392 


A more convenient approximation of sufficient accuracy has recently been 
given by Cochran(!), who extended a method suggested by Fisher(4). It is 
essentially an approximation (by the normal distribution) to Fisher’s z-trans- 


CATHERINE M. THomMPSON 167 


formation of x and it involves values of the normal deviate y at the appropriate 
percentage levels P. These normal deviates y together with a function of y 
denoted by A are tabulated above for the levels P with which we are concerned 
here. 

To find an approximation to z for any pair of arguments 2p, 2q¢ in the above 
range, calculate in turn the quantities 








8pq 
hw 
2p + 2q 
oa aff 28S - 
(A —A) pa 
2p 
** Qp+ 2qe*" 


As examples we consider two tabular values of x in order to obtain some idea of 
the accuracy of the approximation. 


Example 6. P=001, 2p=120, 2g = 40, 


_ 8x 60x20 _ 6, , _ 23263 | 1-235(—60) 
- ~  ? “(58-598 60 x 60 


160 
This agrees with the exact value to four decimals. 





= 0-2833, z= 0-6299. 


Example 7. P=0-50, 2p = 30, 2q¢ = 120, 


8x 15x60 _ ,, _ 0-333 x 18 


A= 150 : 2= ex ag = 000838, x = 0-1974. 





This differs from the exact value by about a unit in the fourth decimal. 


REFERENCES 


(1) Cocnran, W. G. (1940). Ann. Math. Statist. 11, 93. 
(2) Comrtg, L. J. (1936). J.R. Statist. Soc. 3, 87. 
(3) FisHer, R. A. & Yates, F. (1938). Statistical Tables for Biological Agricultural and 
Medical Research. Oliver and Boyd. 
(4) Fisner, R. A. (1936). Statistical Methods for Research Workers, p. 237. Edinburgh: 
Oliver and Boyd. 
(5) Pearson, K. (1922). Tables of the Incomplete I’-function. London: Biometrika. 
(6) (1934). Tables of the Incomplete Beta-function. London: Biometrika. 
(7) (1931). Tables for Statisticians and Biometricians, Part II. London: Biometrika. 
(8) Soper, H. E. (1921). Tracts for Computers, No. 7. Cambridge University Press. 
(9) U.S.A. Work Program W.P.A. (1939). Table of the First Ten Powers of the Integers 
from 1 to 1000. 
(10) WisHart, J. (1927). Biometrika, 19, 1. 

















Percentage points of the incomplete beta-function 





Beta DISTRIBUTION: 50 PER CENT POINTS FOR & 









































v= 2g Vg= 2p 
NN | 
INS 1 2 3 4 5 6 7 8 9 
|" 
| 1 |0-50000 |0-25000 |0-16319 |0-12061 | 0-095526 | 0-079033 | 0-067378 |0-058711 | 0-052015 
| 2| -75000 | -50000 | -37004 .| -29289 | -24214 | -20630 | -17966 | -15910 | -14276 
| 3| -s3681 62996 | -50000 | -41363 | -35245 | -30695 | -27181 24386 | -22112 
4 | -87939 | -70711 -58637 | -50000 | -43556 | -38573 | -34609 | -31381 -28708 
5 |0-90447 |0-75786 |0-64755 |0-56444 |0-50000 |0-44867 |0-40684 |0-37213 | 0-34286 
6 | -92097 79370 | -69305 | -61427 55138 | 50000 =| -45737 -42141 -39068 
7 | -93262 | -82034 | -72819 | -65391 59316 | -54263 | -50000 | -46355 | -43205 
8 | -94129 | -84090 | -75614 | -68612 | -62787 -57859 | -53645 | -50000 | -46818 
9 | -94799 | -85724 | -77888 | -71297 65714 | -60932 | -56795 | -53182 | -50000 
10 | 0-95331 0-87055 0-79775 0-73555 0-68214 0-63588 0-59546 0-55984 0-52824 
11 | -95765 | -88159 | -81366 | -75484 | -70376 | -65907 -61968 -58471 -55346 
12 | -96125 | -89090 | -82725 77151 -72262 | -67948 64116 | -60692 -57613 
13 | -96429 | -89885 | -83899 | -78606 | -73923 | -69759 -66035 | -62687  -59661 
14 | -96689 | -90572 | -84924 | -79887 ‘75396 | -71376 | -67760 | -64490 | -61520 
15 |0-96913 |0-91172 |0-85827 |0-81023 |0-76712 |0-72830 |0-69318 |0-66127 | 0-63216 
16 -97109 -91700 *86627 -82038 *77894 -74143 -70732 -67620 -64768 
17 *97282 -92169 | -87342 *82950 ~ -78963 *75334 -72022 -68986 66195 
18 *97435 -92587 | -87985 *83774 -79932 -76421 -73203 -70242 ‘67511 
19 *97572 -92964 *88565 *84522 -80817 *77417 -74288 -71401 -68728 
20 |0-97695 |0-93303 |0-89092 |0-85204 |0-81626 |0-78331 |0-75289 ‘iia 0-69858 
21 | -97806 | -93612 | -89573 | -85828 | -82370 | -79175 | -76215 | -73467 -70909 
22 *97907 -93893 -90013 *86402 *83057 -79955 77074 | *74392 -71889 
23 | -97999 | -94151 -90417 -86931 -83692 80679 | -77873 | -75254 | -72805 
24 -98083 -94387 -90790 *87421 -84281 *81353 -78618 | -76061 -73663 
| 
25 | 0-98161 0-94606 0-91135 0-87875 0-84828 0-81981 0-79315 | 0-76816 0-74469 
| 26 | -98232 94808 | -91455 | -88298 | -85340 | -82568 | -79968 | -77526 | -75227 
| 27] -98298 | -94995 | -91753 ‘88692 | -85817 83118 | -80581 | -78193 | -75941 
| 28 -98360 -95170 -92031 -89060 *86265 *83635 “81157 *78821 -76615 
| 99 | -98417 95332 | -92290 | -89406 | -86685 | -84120 | -81701 19415 | -77253 
| 30 | 0-98470 0-95484 0-92534 0-89730 0-87080 0-84578 0-82214 0-79976 0-77856 
| 40 -98855 -96594 -94324 -92136 -90038 | -88030 ‘86107 | -84266 *82501 
| 60 -99238 ‘97716 -96164 -94645 -93166 | 91731 -90338 “88985 -87672 
120 -99620 *98851 -98056 -97264 96482 | *95710 94951 -94202 *93465 
co | 1-00000 1-00000 1-00000 1-00000 | 1-00000 | 1-00000 1-00000 1-00000 1-00000 
| | | | | 











This table gives the values of x for which I, (p, g)=0-50 where p= }»,, q= 4. 











052015 
14276 
22112 
28703 





34286 | 
39068 
43205 
46818 
50000 


52824 
55346 
57613 
59661 
61520 | 


‘63216 | 
‘64768 
‘66195 
‘67511 
‘68728 


69858 
-70909 
‘71889 
-72805 
-73663 





} 
7856 | 
*82501 


-00000 


aaa: 






































CATHERINE M. THOMPSON 169 
Beta DIsTRIBUTION: 50 PER CENT POINTS FOR x 
v= 2q v= 2p 
" 10 12 15 20 24 30 40 60 120 
Vg 
1 |0-046687 | 0-038746 | 0-030867 | 0-023052 | 0-019168 |0-015301 | 0-011450 | 0-0076165 | 0-0037997 
2| -12945 | -10910 -088278 | -066967 | -056126 | -045158 | -034064 | -022840 | -011486 
3 | -20225 -17275 -14173 -10908 -092099 | -074664 | -056756 | -038355 | -019443 
4 | -26445 -22849 -18977 -14796 -12579 -10270 078644 | -053552 | -027361 
| 
5 |0-31786 | 0-27738 | 0-23288 |0-18374 | 0-15719 |0-12920 | 0-099622 | 0-068335 | 0-035184 
6 | -36412 | -32052 -27170 -21669 -18647 15422 -11970 -082690 | -042896 
7 | -40454 | -35884 -30682 -24711 -21382 “17786 -13893 096624 | -050494 
8 | -44016 -39308 -33873 27528 “23939 -20024 -15734 “11015 -057977 
9 | -47176 | -42387 -36784 -30142 -26337 22144 -17499 -12328 065345 
10 |0-50000 | 0-45169 |0-89451 |0-32575 |0-28589 |0-24154 |0-19192 |0-13603 | 0-072602 
11 | -52538 -47696 -41902 34845 -30707 -26064 -20818 -14842 079747 
12 | -54831 -50000 -44162 -36967 -32704 -27880 -22379 -16046 -086785 
13 | -56912 -52110 46254 -38956 -34589 29610 |. -23880 -17217 -093716 
14 | -58811 -54049 -48194 -40823 -36371 -31258 -25325 “18355 -10054 
15 |0-60549 | 0-55838 |0-50000 /|0-42579 |0-38059 |0-32832 |0-26715 |0-19463 | 0-10727 
16 | -62147 -57492 51684 44234 -39660 -84335 -28055 -20541 -11390 
17 | -63621 -59027 -53258 -45797 -41181 -35772 -29347 -21591 -12043 
18 | -64984 | -60456 64733 -47274 -42626 -37147 -30593 -22613 -12686 
19 | -66248 -61788 -56118 -48673 -44002 88465 -31796 -23609 -13320 
20 |0-67425 | 0-63033 |0-57421 |0-50000 |0-45314 |0-39729 |0-32958 |0-24580 |0-13945 
21 | -68522 -64200 58649 -51260 -46566 -40942 -34082 -25527 -14561 
22 | 69548 | -65295 -59807 ‘52458 | -47762 -42108 -35168 -26450 -15168 
23 | -70509 | -66325 -60903 53599 | -48905 -43228 -36219 -27350 -15767 
24 | -71411 -67296 61941 -54686 | -50000 -44305 -37237 -28229 -16357 
25 |0-72260 |0-68213 |0-62924 |0-55723 |0-51049 | 0-45343 | 0-38222 |0-29086 | 0-16939 
26 | -73060 | -69079 -63859 -56714 52054 | -46342 -39177 -29924 :17513 
27 | -73815 | -69900 64747 57662 53019 | -47306 -40103 -30741 -18079 
28 | -74529 | -70678 65593 58569 53946 | -48236 -41001 -31540 -18638 
29 | -75205 | -71417 -66399 -59437 54837 | -49133 -41873 -32321 -19189 
| | 
30 | 0-75846 | 0-72120 0-67168 |0-60271 |0-55695 |0-50000 |0-42720 |0-33084 | 0-19732 
40 | -80808 | -77621 73285 -67042 -62763 -57280 -50000 -39866 24791 
| 60 | -86397 | -83954 -80537 75420 71771 -66916 -60134 -50000 -33209 
| 120 | -92740 | -91321 -89273 86055 83643 -80268 -75209 -66791 -50000 
2 |1-00000 | 1-00000 | 1-00000 |1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 
| 





Biometrika xxx 





For. v, = 0, x=0 





12 








170 Percentage points of the incomplete beta-function 


Beta DIstRIsvuTion: 25 PER CENT POINTS FOR x 
y= 2q v,= 2p 





4 5 6 7 8 e 4 "1 
Vg 





1 | 0-14645 | 0-062500 | 0-039063 | 0-028309 | 0-022173 | 0-018215 | 0-015453 |0-013416 | 0-011853 1 
2 | -43750 -25000 *17452 *13397 -10870 -091440 | -078908 -069395, | -061929 2 
3 | -59715 -39685 *29801 *23885 +19937 -17113 -14991 -13339 -12015 | 3 
4 | -68878 -50000 -39448 -32635 *27852 +24302 *21560 -19376 -17596 + 


5 | 0-74711 0-57435 |0-46936 |0-39775 |0-34546 /|0-30550 |0-27390 | 0-24828 | 0-22707 5 
6 | -78726 -62996 -52848 -45632 -40198 -35944 -32516 -29692 *27323 6 
7 | -81650 67295 *57609 -50494 -45001 -40614 *37021 +34022 -31478 7 
8 | -83872 “70711 61516 -54582 . -49117 -44680 -40996 *37885 *35219 8 
9 | -85616 *73487 -64773 -58060 -62678 -48245 -44521 -41343 -38597 9 



































10 |0-87021 |0-75786 |0-67529 |0-61052 | 0-55783 |0-51390 |0-47662 |0-44451 |0-41655 | | 10 
11 | 88177 | -77720 | -69888 | -63651 | -68513 | -54184 | -50475 | -47257 | -44435 | | 11 
12 | -89144 | -79370 | -71931 | -65929 | -60930 | -56679 | -53009 | -49801 | -46970 | | 12 
13 | -89966 | -80793 | -73716 | -67941 | -63085 | -58921 | -55300 | -52116 | -49289 | | 13 
14 | -90672 | -82034 | -75288 | -69730 | -65017 | -60946 | -57382 | -54230 | -51419 14 
15 |0-91285 |0-83124 |0-76684 |0-71332 |0-66758 |0-62782 |0-59282 |0-56169 | 0-53380 15 
16 | -91823 | -84090 | -77932 | -72773 | -68336 | -64456 | -61021 | -57953 | -55192 | 16 
17 | -92298 | -84951 | -79053 | -74077 -| -69772 | -65986 | -62619 | -59599 | -56870 | 17 
18 | -92721 | -85724 | -80066 | -75263 | -71084 | -67391 | -64093 | -61122 | -58428 | 18 
19 | -93100 | -86422 | -s0986 | -76345 | -72287 | -68686 | -65456 | -62536 | -59879 19 
20 |0-93442 | 0-87055 |0-81825 |0-77337- |0-73395 | 0-69882 | 0-66720 |0-63852 |0-61234 | | 2¢ 
21 | -93751 | -87632 | -82593 | -78250 | -74418 | -70991 | -67895 | -65079 | -@2500 | | 21 
22 | -94033 | -88159 | -83299 | -79092 | -75366 | -72021 | -68991 | -66226 | -63688 | | 2% 
23 | -94290 | -88644 | -83950 | -79871 | -76246 | -72981 | -70015 | -67300 | -64803 | | 2: 
24 | -94526 | -89090 | -84553 | -80595 | -77066 | -73878 | -70973 | -68309 | -65852 | | 2 
25 |0-94744 | 0-89503 |0-85112 |0-81268 |0-77831 |0-74717 |0-71873 | 069258 |0-66840 | | 2 
26 | -94944 | -89885 | -85632 | -81896 | -78547 | -75505 | -72719 | -70151 | -67774 | | 2 
27 | -95130 | -90241 | -86116 | -82484 | -79218 | -76244 | -73515 | -70995 | -68657 | | 2’ 
28 | -95303 | -90572 | -86570 | -830356 | -79848 | -76941 | -74267 | -71793 | -e9492 | | 2 
29 | -95464 | -90882 | -86994 | -83552 | -80442 | -77598 | -74977 | -72548 | -70285 | 2 
30 |0-95614 |0-91172 |0-87393 | 0-84039 | 0-81002 [078219 |0-75649 | 0-73263 | 0-71038 | 3 
40} -96706 | -93303 | -90351 | -87685 | -85230 | -82947 | -80809 | -78797 | -76896 | 4 
60 | -97801 | -95484 | -93434 | -91548 | -89782 | -88113 | -86525 | -85009 | -83557 | 6 
120 | -98899 | -97716 | -96648 | -95647 | -94692 | -93774 | -92887 | -92025 | -91187 12 
co |1-00000 | 1-00000 | 1-00000 | 1-00000 |1-00000 |1-00000 /|1-00000 | 1-00000 | 1-00000 








This table gives the values of x for which I, (p, g)= 0-25 where p=}, g= 3%. 





ae 
11853 
161929 


2015 
7596 


2707 
7323 
1478 
5219 
8597 


1655 
4435 
6970 
9289 | 
1419 


3380 
5192 
6870 
8428 
9879 





1234 | 
(2500 | 
3688 | 
54803 
35852 


56840 
7774 | 
38657 | 
39492 | 
10285 


11038 
]6896 
33557 
1187 
0000 
























































CATHERINE M. THOMPSON 171 
Beta DISTRIBUTION: 25 PER CENT POINTS FOR x 
w= 2q v= 2p 
"a 10 12 15 20 24 30 40 60 120 
Ve 
1 | 0-010616 0-0087814 | 0-0069734 0-0051914 | 0-0043101 | 0-0034353 | 0-0025669 | 0-0017049 | 0-0°84926 
2| -055913 | -046816 | -037631 | -028358 | -023689 | -018996 | -014281 | -0095436| -0047832 
3 | -10930 -092592 | -075324 | -057467 | -048307 | -038986 | -029500 | -019844 | -010012 
4 | -16116 -13797 | -11350 -087610 | -074095 | -060174 | -045827 | -031031 | -015764 
5 |0-20822 |0-18082 [0-15026 [011726 | 0-099749 |0-081498 | 0-062458 | 0-042571 | 0-021774 
6 | -25307 | -22058 -18500 | -14585 -12475 -10251- | -079043 | -054222 | -027923 
74 -29291 | -25724 -21758 -17316 -14888 -12301 095405 | -065857 | -034143 
8 | -32908 -29099 | -24802 -19913 -17203 -14289 “11145 -077403 | -040396 
9 | -36198 -32205 | -27644 -22376 -19420 -16211 -12712 | -088814 | -046656 
10 |0-39196 | 0-35068 | 0-30297 | 0-24710 | 0-21538 | 0-18064 | 0-14240 | 010006 0-052904 
11 | -41938 ‘37712 | -32776 «| -26921 -23562 | -19850 15726 | -11113 “059127 
12 | -44451 | 40158 | -35094 | -29017 | -25493 | -21570 ‘17171 +12200 065317 
13 | -46762 | -42426 | -37265 | -31004 | -27338 | -23226 | -18575 | -13267 071466 
14 | -48893 | 44534 | -39302 | -32889 | ‘29100 | -24819 -19938 | -14314 077570 
| 15 |0-50863 | 0-46497 jo-41215 |osssro |0-30785 |0-26353 | 0-21261 | 0-15341 | 0-083624 
| 16 | -52691 | -48330 | -43016 -36380 -32395 | -27831 22546 | -16347 -089625 
| 17 | -54389 | 50043 | -44712 -37998 -33936 | -29254 | -23793 | -17332 095571 
| 18 | -55972 51649 | -46312 -39539 35411 | -30625 | -25004 | -18298 -10146 
| 19 | -57449 | 53156 | -47825 -41008 “36824 | -31946 | -26179 | 19244 -10729 
pa | | 
| 20 |0-58832 |0-54574 |0-49256 |0-42409 | 0-38179 |0-33221 | 0.27321 |0-20170 | 0-11307 
21 | -60129 | -55909 | -50613 43746 | -39478 | -34450 | -28430 | -21078 -11878 
22 | -61348 | -57169 | -51900 | -45025 | -40726 | -35637 29507 | -21963 -12443 
| 23 | -62495 | -58360 | -53122 | -46247 -41925 | -36783 -30554 -22837 “13003 
| 24 | -63576 | -59487 | 54285 | -47418 -43078 | -37891 ‘31572 | -23690 -13556 
| 25 |0-64597 |0-60556 | 0-55392 wade 0-44186 |0-38961 |0-32561 | 0-24525 | 0-14103 
26 | -65563 | -61570 | -56447 | -49615 -45253 -39996 -33524 -25343 14645 
27 | -66478 | -62533 | -57455 -50647 -46281 -40997 -34460 -26145 -15180 
28 | -67346 | -63450 | -58417 -51639 “47272 -41967 -35371 -26931 -15710 
29 | -68170 | -64323 | -59337 -52591 -48228 -42906 -36259 -27701 -16234 
| | | 
| 30 |0-68954 | 0-65156 | 0-60217 |0-53508 | 0-49150 |0-43815 |0-37122 | 0-28455 | 0-16752 
| 40 | +75095 -71758 | -67308 -61054 -56853 -51555 -44650 35245 -21626 
| 60 | -82163 79529 | -75915 -70620 -66914 -62056 -55390 -45636 -29913 
| 120 | -90370 88794 | -86554 83103 -80557 “77041 -71852 -63381 -46918 
| co |1-00000 | 1-00000 pore 100000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 
| 





























For »,= 0, 7=0 








172 


Percentage points of the incomplete beta-function 





Beta DISTRIBUTION: 10 PER CENT POINTS FOR x 

















vg= 2p 
1 2 3 5 6 7 8 

| 0-024472 |0-010000 | 0-0061812 0-0034818 | 0-0028553 | 0-0024193 | 0-0020986 | 0-0018523 
| -19000 -10000 -067830 -041268 -034511 -029654 -025996 -023141 
| +35136 *21544 -15648 -10154 -086434 -075257 -066647 -059809 
| -46812 *31623 *24136 -16493 -14256 -12558 *11224 -10147 
| 0-55185 0-39811 0-31529 0-22457 0-19664 0-17498 | 0-15766 0-14349 
*61375 -46416 -37816 *27858 *24664 -22139 =| -20091 *18394 
-66104 -51795 -43151 -32685 -29210 -26421 *24127 222067 

| *69821 -56234 -47700 -36982 -33319 *30339 -27860 *25764 
| -72814 -59948 -51610 -40811 -37029 “33915 *31299 *29067 
| 0-75273 0-63096 0-54996 0-44232 0-40382 0-37178 0-34462 0-32128 
| -77328 *65793 -57954 -47300 -43419 -40159 *37374 *34963 
| -79069 -68129 -60555 -50062 -45178 -42889 -40058 *37592 
| -80564 -70170 -62860 -52560 -48693 45393 -42535 -40032 
*81861 -71969 *64915 *54827 -50992 -47697 -44827 -42299 

| 0-82996 | 0-73564 | 0-66758 0-56893 |0-53100 |0-49822 |0-46951 | 0-44410 
| 83998 *74989 *68419 -58783 -55040 -51787 -48924 -46380 
84889 *76270 *69923 *60517 -56829 -563608 -50760 -48219 
*85686 *77426 *71293 *62114 -58484 -565300 -52473 -49942 

| -86403 *78476 *72544 *63588 -60020 -56876 *54074 *51557 
| 0-87052 0-79433 0-73691 | 0-64954 0-61448 0-58347 *55574 -53073 
*87643 *80309 “74747 *66222 *62779 -59722 -56980 *54500 
*88181 “81113 *75722 67403 -64022 61011 -58302 *55845 
| 788675 *81855 -36625 | *68504 -65187 *62222 *59547 “57115 
*89129 “82540 *77464 | *69535 *66279 63361 -60721 58314 
0-89549 |0-83176 | 0-78245 |0-70500 |0-67305 | 0-64434 61829 | 0-59450 
*89937 *83768 ‘78973 | -71407 -68271 65446 *62878 -60526 
-90297 “84319 *79655 | *72260 *69183 *66403 -63871 -61548 
-90633 “84834 *80294 73064 *70044 *67309 *64813 -62518 
-90946 *85317 80894 73823 -70858 68168 65708 63442 
0-91239 0:85770 0-81459 ( 0-74541 0-71630 0-68984 0-66559 0-64322 
93381 “89125 *85693 80025 *77578 *75319 “73219 *71255 
*95555 -92612 -90182 86048 *84212 *82490 “80864 *79321 
‘97761 -96235 *94944 92679 91643 *90653 “89702 *88785 
1-00000 | 1-00000 100000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 









































This table gives the values of x for which I, (p, g)=-10 where p= 4v,., g= 41. 











Cc oD 


0018528 
023141 
059809 
-10147 


-14349 
-18394 
-22267 
-25764 
*29067 


32128 
34963 
37592 
-40032 
42299 


44410 
‘46380 
‘48219 
49942 
51557 


‘53073 
‘54500 
55845 
57115 
‘58314 


59450 
‘60526 
‘61548 
‘62518 
‘63442 


64322 
‘71255 
79321 
88785 
00000 


















































CATHERINE M. THOMPSON 173 
Beta DISTRIBUTION: 10 PER CENT POINTS FOR x 
v= 2q¢ vg= 2p 
_ 10 12 15 20 24 30 40 60 120 
Vg \ 
1 | 00016585 | 0-0013709 | 0-0010878 | 0-0380919 | 0-0367157 | 0-0353506 | 0-0°39965 | 0-0326535 | 0-0313213 
2 | -020852 | -017407 | -013950 | -010481 | -0087416| -0069994| -0052542| -0035059| -0017545 
3 | -054245 | -045740 | -037035 | -028119 | -023579 | -018982 | -014327 | -0096132| -0048379 
4 | -092595 | -078823 | -064482 | -049452 | -041691 | -033749 | -025617 | -017288 | -0087521 
5 |0-13167 |0-11307 |0-093336 | 0-072324 |0-061295 | 0-049889 |0-038083 |0-025851 | 0-018167 
6 | -16964 | -14685 -12228 -095653 | -081477 | -066668 | -051174 | -034941 | -017906 
7 | -20573 | -17941 -15059 -11886 -10173 -083668 | -064573 | -044345 | -022866 
| § | -23966 | -21040 -17792 14161 -12177 -10064 -078083 | -053928 | -027978 
| g| -27139 | +23970 20411 -16374 -14141 -11743 -091577 | -063600 | -033196 
| 
| 10 |0-30097 | 0-26732 |0-22908 |0-18513 |0-16056 |0-13394 |0-10497 |0-073298 | 0038489 
| 1i | -32853 -29330 25284 -20576 -17915 -15010 -11820 -082977 | -043832 
12 | -35422 -31772 -27540 22559 -19716 -16587 13123 -092604 | -049206 
13 | -37817 -34068 -29682 24464 21457 -18124 14403 10215 054597 
14 | -40053 -36228 31715 26292 23139 -19619 15659 “11161 | 059993 
| 
15 |0-42143 |0-38261 |0-33645 | 0-28045 |0-24762 |0-21072 |0-16889 /|0-12096 | 0-065386 
16 | -44100 | -40176 -35478 -29726 26327 22483 -18093 13019 | -070768 
17 | -45934 | -41983 37219 31338 -27837 23853 -19270 -1393u | -076134 
18 | -47657 | -43689 | -38875 32885 -29293 25182 -20420 -14828 | -081478 
19 | -49277 | -45302 -40451 -34369 -30697 26471 21544 -15712 | -086796 
} 
20 | 0-50803 ae 0-41952 |0-35793 |0-32051 |0-27721 |0-22642 |0-16583 | 0-092085 
21 | -52243 -48276 | -43382 -37161 33358 28934 -23713 -17440 -097342 
22 | -53603 -49649 -44746 38475 ‘34619 | -30111 -24759 18283 10257 
23 | -54889 50953 -46049 | -39738 -35836 | -31253 -25781 19112 | -10775 
24 | -56108 | -52193 47294 40954 -37012 -32361 -26778 | -19928 | -11290 
| | | 
| 25 |0-57263 | 0-53373 | 0-48485 | 0-42123 | 0-38147 | 0-33437 | 0-27751 | 020730 0-11801 
| 26 | -58361 | -54498 -49624 | -43248 ‘39245 | -34481 | -28701 -21518 -12308 
| 27 | -59405 | -55571 50716 | -44333 40306 | -35495 | -29629 | -22293 12811 | 
| 28 | -60398 56595 -51763 45378 -41332 -36479 30534 | -23054 -13310 | 
29 | -61344 -B7574 52767 -46386 42325 -374386 | -31419 23803 -13804 
| | 
30 |0-62247 | 0-58511 |0-53731 |0-47359 | 0:43286 | 0-38366 | 0-32283 | 0-24539 | 0-14295 | 
40 | -69412 | -66034 -61599 -55476 51428 | -46386 -39910 -31243 -18960 | 
60 | -77851 ‘75104 71386 -66029 62333 | 57545 | -51067 -41750 27063 | 
120 | -87897 86198 83814 -80192 ‘77553 | -73946 | -68688 60235 -44158 
co |1-00000 | 1-:00000 |1-00000 |1-00000 |1-00000 /|1-00000 | 1-00000 |1-00000 | 1-00000 
| | | 























For vy, = 0, x=0 








174 


Percentage points of the incomplete beta-function 





Beta DIsTRIBUTION: 5 PER CENT PorINTS FOR x 
























































v= 2g ¥,= 2p 
F | 
| " a a 3 4 5 6 7 8 9 
| 
> | a 
| 1 | 6-0061558 | 0-0025000 | 0-0015429 | 0-0011119 | 0-0386820 | 0-0°71179 | 0-0860300 | 0-0952300 | 0-0°46170 
2 | -097500 | -050000 | -033617 | -025321 | -020308 | -016952 | -014548 | -012741 | -011334 
| 3 | -22852 | -13572 | -097308 | -076010 | -062412 | -052962 | -046007 | -040671 | -036447 
| 4 | 34163 | -22361 | -16825 | -13535 | -11338 | -097611 | -085728 | -076440 | -068979 
5 |0-43074 |0-30171 | 0-23553 |0-19403 /|0-16528 |0-14408 |0-12778 |0-11482 | 0-10427 
6 | -50053 | -36840 | -29599 | -24860 | -21477 | -18926 | -16927 | -15316 | -13989 
| 7 | +55593 | -42489 | 34929 | -29811 | -26063 | -23182 | -20890 | -19019 | -17461 
| 8 | -60071 | -47287 | -39607 | -34259 -| -30260 | -27134 | -24613 | -22532 | -20783 
9 | 63751 | -51390 | -43716 | -38245 | -34080 | -30777 | -28082 | -25835 | -23930 
| 
| | | | 
10 |0-66824 /0-54928 | 0-47338 | 0-41820 | 0-37553 | 0-34126 |0-31301 | o.28924 | 0-26894 
11 | -69425 | ‘58003 | -50546 =| -45033 | -40712 | -37203 | -34283 | -31807 | -29677 
12 | -71654 | -60696 | -53402 | -47930 | -43590 | -40031 | -37044 | -34494 | -32286 
13 | -73583 | -63073 | -55958 | -50551 | -46219 | -42635 | -39604 | -37000 | -34732 
14 | -75268 | -65184 | -58256 | -52932 | -48626 | -45036 | -41980 | -39338 | -37025 
| | | | 
| 15 |0-76754 |0-67070 | 0-60333 |0-55102 |0-50836 |0-47255 |0-44187 |0-41521 | 0-39176 
16 | -78072 | -68766 | -62217 | -57086 | -52872 | -49310 | -46242 | -43563 | -41196 
17 | -79249 | -70297 | -63933 | -58907 | -54750 | -51217 | -48158 | -45474 | -43094 
18 | -80307 | -71687 | -65503 | -60584 | -56490 | -52991 -49949 | -47267 | -44880 
19 | -81263 | -72954 | -66944 | -62131 | -58103 | -54645 | -51624 | -48951 | -46564 
20 |0-82131 |0-74113 |0-68271 |0-63564 | 0-59605 | 056189 053194 | 0-50535 | 0-48152 
21 | -82923 | -75178 | -69496 | -64894 | -61004 | -57635 | -54669 | -52027 | -49652 
| 22 | -83647 | -76160 | -70632 | -66132 | -62312 | -58990 | -56056 | -53434 | -51071 
| 23 | -84313 | -77067 | -71687 | -67287 | -63536 | -60263 | -57363 | 54764 | -52415 
| 24 | -84927 -| -77908 | -72669 | -68366 | -64684 | -61461 | -58596 | -56022 | -53689 
| 25 |0-85494 |0-78690 |0-73586 |0-69377 |0-65764 |0-62590 |0-59761 |0-57213 | 0-54898 
| 26 | -86021 ‘79418 | -74444 | -70327 | -66780. | -63656 | -60864 | -58343 | -56048 
| 27 | -86511 | -80099 ‘75249 ‘71219 | -67738 =| -64663 | -61909 | -59416 ‘57141 
| 28 | -86967 | -80736 | -76004 | -72060 | -68643 | -65617 | -62900 | -60436 | -58183 
| 29 | -87394 | -81334 | -76715 | -72854 | 69499 | -66522 | -63842 | -61407 | -59177 
| | | | | | 
| 30 |0-87794 | 0-81896 | 0-77386 | 0-73604 | 0-70311 |0-67381 |0-64738 |0-62332 | 0-60125 
| 40| -90734 | -seos9 | -82447 | -79327 | -76559 | -74053 | -71758 | -69636 | -67663 
| 60 | -93748 | -90497 | -87881 | -85591 | -83517 | -81606 | -79824 | -78150 | -76569 
| 120 | 96837 | -95130 | -93720 | -92458 | -91290 | -g0192 | -89148 | -88150 | -87191 
| 2 |1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 |1-00000 | 1-00000 
| | | | | | | | 


| 


L 


a 








This table gives the values of x for which I, (p, ¢)= 0-05 where Pp 


i 


=, q7=)y. 


————-—_ 


46170 | 
1334 
6447 
8979 


427 
989 
461 
1783 
1930 


3894 
1677 
286 | 
1732 | 
1025 


9176 
1196 
3094 
1880 
6564 


8152 
9652 
1071 
2415 
3689 


4898 
6048 
7141 
8183 
9177 


0125 | 
7663 | 
6569 

| 


7191 
0000 



































| 
| 
1 





| 
1 











i 








| 1-00000 





CATHERINE M. THompson 175 
Beta DIstrRIBUTION: 5 PER CENT POINTS FOR x 
v= 2g v= 2p 
\n 10 12 15 20 24 30 40 60 120 
Ve 
1 | 0-0°41325 | 0-0934155 | 0-0°27098 | 0-0°20156 | 0-0316727 | 0-0°13326 | 0-0*99535 | 0-0*66082 | 0-0'32904 
2 | -010206 | -0065124| -0068158| -0051162| -0042653| -0034137| -0025614| -0017083| -0°85452 
3 | -033020 | ‘027794 | -022465 | -017026 | -014264 | -011472 | -0086511| -0057991| -0029157 
4 | -062850 | -053375 | -043541 | -033319 | -028053 | -022679 | -017191 | -011585 | -0058567 
| | 

5 |0-095510 |0-081790 | 0-067312 | 0-051995 | 0-043994 | 0-035747 | 0-027240 | 0-018458 | 0-0093841 

6 | -12876 ‘11111 -092207 | -071870 | -061103 | -049898 | -038224 | -026043 | -013317 

7 | -16142 -14029 11733 -092238 | -078783 | -064651 | -049781 | -034103 | -017540 
8 | -19290 -16875 -14216 -11267 096658 | -079695 | -061675 | -042481 | -021976 | 
9 | -22292 -19618 -16638 | -13288 -11449 -094827 | -073748 | -051068 | -026572 | 

10 |0-25137 |0-22244 |0-18984 |0-15272 |0-13211 |0-10991 | 0-085885 |0-059785 | 0-031288 

ll | -27823 -24746 21244 17207 -14943 -12484 -098008 | -068575 | -036094 

12 | -30354 27125 -23413 19086 | 16636 | -13955 -11006 -077394 | -040967 

13 | -32737 -29383 -25492 -20908 “18288 | -15401 -12199 -086209 | -045889 

14 | -34981 31524 -27481 -22669 -19895 | “16818 | -13377 094994 | -050847 

| 

15 |0-37095 |0-33554 |0-29382 /|0-24870 |0-21457 |0-18203 | 0-14539 |0-10373 | 0-055827 

16 | -39086 35480 31199 -26011 22972 | -19556 -15682 | -11240 -060821 
17 | -40965 37307 -32936 27594 ‘24441 | -20877 -16805 | -12099 -065820 | 

18 | -42738 -39041 34596 -29120 ‘25865 | -22164 -17908 -12950 -070818 

19 | -44414 -40689 -36183 ‘30591 | -27244 | -23418 18989 -13791 -075809 

20 |0-45999 | 0-42256 |0-37701 | 0-32009 | 0-28580 |0-24639 |0-20050 /|0-14622 | 0-080789 

21 | -47501 -43746 39154 -33375 | -29874 | -25828 -21088 -15442 | -085753 

22 | -48925 -45165 -40544 34693 -31126 | -26985 -22106 | -16252 | -090698 
23 | -50276 -46518 -41877 -35964 | -32340 | -28112 -23102 | -17051 | -095621 | 
24 | -51560 -47805 43154 37190 33515 | -29208 -24078 | -17838 10052 | 
| 
25 |0-52782 |0-49040 |0-44379 |0-38373 | 0-34653 | 0-30275 | 0-25032 |0-18615 | 0-10539 | 
26 | -53945 -50217 45554 -39516 | -35756 -31314 -25966 | -19379 | -11024 
27 | -55054 51343 -46683 -40619 | -36826 | -32325 -26880 | -20133 11505 | 
28 | -56112 -52420 -47768 -41685 | -37862 | -33309 27775 | -20875 11983 | 
29 | -57122 53452 -48812 | 42715 | -38867 | -34267 -28650 -21606 12458 | 
30 |0-58088 | 0-54442 | 0-49816 |0-43711 |0-39842 |0-35200 |0-29507 |0-22326 | 0-12930 
40 | -65819 | -62460 | -58083 52099 | -48175 -43321 -37136 | -28936 “17453 | 
60 | -75070 | -72282 | -68535 63185 | 59522 54807 -48477 “39458 | -25416 | 
120 | -86266 | 84504 -82047 -78342 | -75661 | -72016 | -66738 -58326 42519 | 
co | 1-00000 porn 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 /1-00000 | 








For »,= 0, r=0 





176 





Percentage points of the incomplete beta-function 


Beta DIstrRIBuTION: 2-5 PER CENT POINTS FOR & 





























v= 2q ¥,= 2p 
" 1 2 3 4 5 6 7 8 9 
ve 
1 | 0-0015413 | 0-0962500 | 0-0338558 | 0-0°27783 | 0-0321691 | 0-0317782 | 0-0915064 | 0-0°13065 | 0-0311533 
2| -049375 | -025006 | -016737 | -012579 | -010076 | -0084038| -0072076| -0063095| -0056104 
3 | -14675 | -085499 | -060830 | -047316 | -038748 | -032820 | -028471 | -025143 | -022513 
4| -24664 | -15811 | -11786 | -094299 | -078706 | -067586 | -059243 | -052745 | -047539 
5 |0-33318 |0-22865 |0-17674 |0-14471 |0-12275 |0-10669 |0-094390 |0-084663 |0-076770 
6 | -40505 | -29240 | -23259 | -19412 | -16695 | -14663 | -13081 | -11812 | -10770 
7 | -46442 | -34855 | -28375 | -24063 | -20942 | -18562 | -16681 | -15153 | -13886 
8 | -51378 | -39764 | -32993 | -28358 4 -24933 | -22278 | -20151 | -18405 | -16944 
9 | -55524 | -44054 | -37137 | -32290 | -28642 | -25774 | -23450 | -21523 | -19897 
10 |0-59043 |0-47818 |0-40855 |0-35877 |0-32071 |0-29042 |0-26561 |0-24486 | 0-22722 
11 | -62062 | -51135 | -44194 | -39146 | -35234 | -32085 | -29482 | -27288 | -25409 
12 -64677 -§4074 *47202 *42128 -38149 34914 -32219 -29930 27957 
13 | -66961 | -56693 | -49920 | -44853 | -40838 | -37545 | -34779 | -32416 | -30368 
14 | -68973 | -59038 | -52385 | -47349 | -43321 | -39991 | -37175 | -34755 | -32646 
15 |0-70756 |0-61149 |0-54628 |0-49641 |0-45618 |0-42268 |0-39418 |0-36955 | 0-34799 
16 | -72349 | -63058 | -56676 | -51750 | -47746 | -44390 | -41520 | -39026 | -36833 
17 | -73778 | -64792 | -58553 | -53697 | -49723 | -46372 | -43490 | -40976 | -38756 
18 -75069 -66373 -60278 55498 “51561 48224 *45341 -42814 40575 
19 | -76239 | -67821 | -61869 | -57169 | -53276 | -49959 | -47081 | -44549 | -42297 
20 |0-77305 |0-69150 }0-63339 |0-58722 |0-54877 |0-51586 |0-48719 |0-46187 | 0-43928 
21 | -78280 | -70376 | -64702 | -60169 | -56375 | -53115 | -50263 | -47736 | -45475 
22 | -79176 | -71509 | -65970 | -61520 | -57780 | -54553 | -51720 | -49202 | -46943 
23 | -80001 | -72559 | -67150 | -62785 | -59100 | -55908 | -53098 | -50592 | -48338 
24 80763 *73535 -68253 -63970 -60341 57187 -54401 -51911 -49664 
25 |0-81469 |0-74445 |0-69285 |0-65084 |0-61511 |0-58396 |0:55636 |0-53163 | 0-50927 
26 | -82126 | -75295 | -70253 | -66132 | -62615 | -59540 | -56808 | -54354 | -52130 
7 | -82738 | -76090 | -71162 | -67119 | -63658 | -60624 | -57922 | -55488 | -53278 
28 | -83310 | -76836 | -72018 | -68052 | -64646 | -61652 | -58980 | -56568 | -54373 
29 | 83845 | -77538 | -72825 | -68933 | -65582 | -62630 | -59988 | -57599 | -55420 
30 |0-84347 |0-78198 |0-73587 |0-69768 |0-66471 |0-63559 |0-60948 |0-58582 | 0-56421 
40 | -88059 ‘83157 -79381 -76184 -73369 -70839 -68532 -66411 -64446 
60 | -91904 | -88430 | -85681 -83298 | -81156 | -79193 | -77372 | -75668 | -74065 
120 | -95883 -94037 *92535 -91201 *89975 -88828 87743 -86708 *85717 
oo | 1-00000 1-00000 |1-00000 |1-00000 |1-00000 |1-00000 |1-00000 /|1-00000 | 1-00000 























This table gives the values of x for which J, (p, g)= 0-025 where p= }»,, = 41. 


Vg 








122 


646 


799 


297 


928 


664 





1927 


130 


420 


3421 
1446 


717 
000 

















CATHERINE M. THOMPSON 177 
Beta DISTRIBUTION: 2-5 PER CENT POINTS FOR x 
¥,=2q V,=2p 
; \" 10 12 15 20 24 30 40 60 120 
Vg 
1 | 0-0°10323 | 0-0485313 | 0-067686 | 0-0450345 | 0-0441780 | 0-0433285 | 0-0424860 | 0-0416505 | 0-0582180 
2} -0050508| -0042107| -0033700| -0025286| -0021076| -0016864| -0012651| -0°84357 | -0°42187 
3 | -020382 | -017139 | -013838 | -010477 | -0087725| -0070519| -0053148!| -0035607| -0017893 
4 | -043272 | -036693 | -029885 | -022831 | -019207 | -015514 | -011749 | -0079110| -0039956 
5 | 0-070233 | 0-060028 | 0-049302 |0-038002 |0-032119 | 0-026068 | 0-019841 | 0-013428 | 0-0068184 
6 | -098988 | -085233 | -070563 | -054861 | -046579 | -037985 | -029056 | -019767 | -010092 
7 | -12818 -11113 092695 | -072663 | -061969 | -050772 | -039029 | -026691 | -013702 
8 | -15701 -13700 -11508 -090920 | -077871 | -064092 | -049508 | -034033 | -017569 
9 | -18504 -16240 -13732 -10931 -094004 | -077712 | -060314 | -041675 | -021634 
10 |0-21201 |0-18709 | 0-15917  |0-12760 |0-11017 | 0-091466 |0-071319 |0-049528 | 0-025854 
11 | -23780 -21091 -18048 14565 -12623 -10523 -082426 | -057522 | -030196 
12 | -26238 -23379 20115 -16336 -14210 -11893 -093564 | -065622 | -034634 
13 | -28573 “25571 -22112 -18067 15770 -13249 10468 -073771 | -039147 
14 | -30790 -27667 -24039 -19753 -17299 -14588 -11573 081944 | -043718 
15 |0-32893 | 0-29668 /|0-25893 |0-21392 |0-18793 |0-15905 |0-12669 | 0-090115 | 0-048335 
16 | -34888 -31578 -27676 -22983 -20252 -17198 -13753 -098266 | -052985 
17 | -36779 -33400 -29389 24525 -21674 -18466 -14823 -10638 -057659 
18 | -38574 -35138 -31034 -26019 -23058 -19708 -15878 11444 -062348 
19 | -40278 -36797 -32614 -27465 24404 -20922 -16916 -12244 -067047 
20 |0-41896 |0-38380 |0-34132 |0-28864 |0-25713 |0-22110 |0-17938 |0-13038 | 0-071749 
21 | -43435 -39893 35589 -30218 -26985 -23270 -18943 -13823 -076450 
22 | -44900 -41338 -36990 -31528 -28221 -24402 -19930 -14601 081144 
23 | -46294 -42720 -38835 -32795 -29422 -25508 -20899 -15370 -085828 
24 | -47623 -44042 -39629 -34021 -30588 -26587 -21850 -16130 -090500 
25 |0-48891 |0-45307 |0-40874 |0-35207 |0-31721 |0-27640 |0-22783 |0-16881 | 0-095156 
26 | -50101 -46520 -42071 -36355 -32821 -28667 -23698 -17622 -099794 
27. | -51257 -47682 | -43223 -37466 -33890 -29669 -24596 -18354 -10441 
| 28 | -52363 -48797 -44334 38542 -34928 -30647 -25476 -19076 -10901 
| 29] -53421 -49867 -45403 39584 -35937 -31601 -26339 -19789 -11358 
| 3010-54435 |0-50895 |0-46434 |0-40594 |0-36918 |0-32532- |0-27185 |0-20492 |0-11812 
40 | -62616 -59296 -54999 -49168 -45370 -40697 ‘34780 | -26997 -16201 
60 | -72550 -69743 65992 -60674 -57056 -52422 -46239 | -37498 -24027 
| 120 | -84764 82954 -80442 -76678 -73968 -70299 -65017 -56658 -41107 
& 100000 | 1-00000 | 1-00000 | 1-00000 /|1-00000 |1-00000 | 1-00000 | 1-00000 | 1-00000 





























For »,= 0, z=0 











178 


Percentage points of the incomplete beta-function 


Beta DISTRIBUTION: 1 PER CENT POINTS FOR x 






































v= 2g ¥,=2p 
we 1 2 3 4 5 6 7 8 9 
Ve 
1 | 0-0324672 | 0-0°10000 | 0-0461686 | 0-0444446 | 0-0434699 | 0-0428446 | 0-0424097 | 0-0420897 | 0-0*18449 
2 | -019900 | -010000 | -0066778| -0050126| -0040121| -0033445| -0028674| -0025094| -0022309 
3 | -080827 | -046416 | -032834 | -025458 | -020807 | -017599 | -015252 | -013458 | -012043 
4 | -15875 | -10000 | -073960 | -058903 | -049014 | -041999 | -036754 | -032682 | -029426 
a 5 |0-23520 | 0-15849 | 0-12142 | 0-098877 | 0-083563 | 0-072429 | 0-063948 | 0-057264 | 0-051857 
6 | -30387 | -21544 | -16979 | -14087 | -12065 | -10564 | -094014 | -084730 | -077136 
7 | -36370 | -26827 | -21636 | -18236 | -15801 | -13959 | -12511 | -11341 | -10375 
8 | -41540 | -31623 | -25997 | -22207 -| -19437 | -17307 | -15612 | -14227 | -13073 
9 | -46009 | -35938 | -30024 | -25945 | -22910 | -20543 | -18637 | -17066 | -15745 
10 |0-49889 | 0-39811 /|0-33719 |0-29431 |0-26191 |0-23632 |0-21551 |0-19820 | 0-18355 
11 | -53279 | -43288 | -37099 | -32667 | -29271 | -26560 | -24335 | -22469 | -20879 
12 | -56258 | -46416 | -40191 -35664 | -32153 | -29323 | -26981 | -25003 | -23307 
13 | -58893 | -49239 | -43020 | -38437 | -34845 | -31924 | -29487 | -27417 | -25631 
14 | -61238 | -51795 | -45615 | -41006 | -37358 | -34369 | -31858 | -29712 | -27851 
15 | 0-63336 | 0-54117 |0-47999 |0-43387 |0-39706 |0-36666 |0-34098 |0-31891 | 0-29968 
16 | -65224 | -56234 50194 | -45597 | -41899 | -38826 | -36214 | -33958 | -31985 
17 | -66930 | -58171 52219 | -47651 -43951 | -40857 | -38213 35920 | -33905 
| 18 | 68479 | -59948 | -54094 | -49565 | 45872 | -42768 | -40103 37781 -35733 
i9 | -69892 | -61585 55832 | -51350 | -47674 | -44568 | -41890 | -39547 | -87474 
| 
| j 
20 |0-71185 |0-63096 |0-57447 | 0-53018 10-4936 |0-46266 | 0-43581 |0-41224 | 0-39131 
21 | -72372 | -64495 | -58952 | -54581 | -50958 | -47868 | -45184 | -42818 | -40711 
22 | -73467 | -65793 | -60357 | -56046 | -52456 | -49383 | -46703 | -44333 | -42217 
23 | -74479 | -67002 | -61671 | -57422 | -53869 | -50816 | -48144 | -45775 | -43653 
24 | 75417 | -68129 | -62903 | -58717 | -55204 | -52174 | -49514 | -47149 | -45025 
| | | | 
25 |0-76290 |0-69183 | 0-64059 |0-59938 | 0-56466 |0-53461 |0-50816 |0-48458 | 0-46335 
| 26] -77103 | -70170 | -65147 | -61090 | -57660 | -54683 | -52055 | -49706 | -47587 
| 27] -77862 | -71097 | -66172 | -62180 | -58793 | -55845 | -53236 | -50899 | -48785 
| 28 | -78573 | -71969 | -67139 | -63211 | -59868 | -56951 | -54362 | -52038 | 49932 
| 29 | -79240 | ‘72790 | -68054 | -64188 | “60890 | -58004 | -55437 | -53127 | -51031 
| 30 | 0-79867 pousees 0-68919 |0-65116 |0-61862 |0-59008 | 0-56464 |[0-54170 | 0-52085 
40 | -84541 | -79433 | -75561 | -72316 | -69482 | -66950 | -64656 | -62555 | -60617 
60 | -89449 | ‘85770 | -82898 | -80433 | -78233 ‘76227. | -74376 | -72651 | -71034 
120 | -94599 | -92612 | -91014 | -89607 | -88321 ‘87124 | -85995 | ‘84924 | -83900 
20 |1-00000 |1-00000 |1-00000 |1-00000 |1-00000 |1-00000 | 1-00000 | 1-00000 | 1-00000 
| | | 








This table gives the values of x for which I, (p, q)=0-01 where p=}, g=4y. 




















1857 
7136 
375 
073 
745 


355 
879 
307 
631 
851 


968 
985 
3905 
5733 
1474 


9131 
0711 
2217 
3653 
5025 


6335 
7587 
8785 
9932 
1031 


2085 
10617 
11034 
33900 
10000 



























































CATHERINE M. THOMPSON 179 
Beta DISTRIBUTION: 1 PER CENT POINTS FOR & 
v,=2q¢ V,= 2p 
ee | | 
aT 12 «Tf 24 | 30 40 | 60 120 
a NI 
nes | | | | 
| | | | | | | 
1 | 0-0416513 | 0-0*13647 | 0-0*10827 | 0-0580531 | 0-0566831 | 0-0553242 | 0-0539766 | 0-0526400 | 0-0513145 | 
2 | -0020080| -0016737| -0013391| -0010045| -0°83718 | -0°66980 | -0°50239 | -0333496 | -0°16749 
3 | -010898 | -0091569| -0073877| -0055887| -0046777| -0037588| -0028317| -0018964| -0°95252 
| 4 | -026763 | -022665 | 018435 | -014065 | -011824 | 0095436 | -0072226 | -0048595| -0024525 
| | | 
| | 
| 5 |0-047389 | 0-040434 | 0-033149 | 0-025503 | 0-021534 | 0-017459 | 0-013275 | 0-0089747 | 0-0045520 
| 6 | -070804 | -060840 | -050258 | -038982 | -033057 | -026923 | -020567 | -013973 | -0071235 
7 | -095627 | -082714 | -068820 | -053801 | -045816 | -037481 | -028767 | -019640 | -010065 
8 | -12095 -10526 | -088177 | -069456 | -059390 | -048797 | -037625 | -025815 | -013300 
9 | -14619 -12796 | -10787 085584 | -073472 | -060623 | -046956 | -032376 | -016768 
| 
10 |0-17097 |0-15044 /|0-12760 |0-10193 | 0-087838 |0-072776 | 0-056621 |0-039229 |0-020426 
| 11 | -19506 -17250 -14713 -11830 -10232 ‘085117 | -066512 | -046303 | -024237 
| 12 | -21834 -19398 -16633 "13458 -11681 097542 | -076547 | -053541 | -028173 
| 13 | -24073 -21479 “18511 “15065 -13120 -10997 -086660 | -060897 | -032212 
14 | -26220 | -23489 -20338 -16646 14544 -12235 -096802 | -068334 | -036335 
| | | 
| 15 |0-28276 |0-25426 |0-22113 |0-18196 |0-15948 |0-13462 | 0-10693 | 0-075824 0-040526 | 
| 16 | -30240 ‘27289 | -23833 | -19711 | -17327 | -14676 -11702 | -083341 | -044772 | 
17 | -32117 “29079 25497 | -21189 | -18681 | -15873 -12704 -090866 | -049062 | 
18 | -33910 30797 | -27105 ‘22630 | -20005 | -17053 | -13697 098383 | -053386 | 
19 | -35622 | -32446 | -28658 “24032 | -21301 | -18212 | -14680 "10588 | -057738 
j | | | | | | 
20 |0-37257 | 0-34029 | 0-30157 | 0-25395 | 0-22567 /0-19351 | 0-15651 | 0-11334 | 0-062109 | 
21 | -38818 | -35548 -31603 -26721 -23803 20468 | -16609 -12076 | -066494 
22 | -40311 -37005 -32999 -28008 -25008 21563 | -17554 -12812 | -070888 | 
23 | -41738 -38405 34345 -29258 -26184 -22636 -18486 “13543 -075285 
24 | -43103 | -39749 | -35645 -30472 -27329 ‘23687 -19403 14268 | -079683 
| | | | 
25 |0-44410 | 0-41040 | 0-36899 |0-31651 | 0-28446 |0-24716 | 0-20305 (| 0-14986 | 0-084077 
26 | -45661 | -42280 | -38109 -32795 -29534 -25722 ‘21193 -15697 -088465 
27 | -46861 | -42473 -39278 -33906 -30594 -26707 -22066 -16401 -092843 
28 | -48011 44621 -40407 34985 -31626 -27670 -22925 -17096 097210 
29 | -49115 45726 =| -41497 | -36032 -32632 -28612 -23768 -17785 “10156 
| | | 
30 |0-50175 |0-46789 | 0-42552 | 0-37049 | 0-33612 | 0-29534 [0-24597 | 0-18465 | 0-10590 
40 | -58819 | -55573 *51398 | -45778 -42144 -37700 “32111 -24819 “14811 
60 | -69511 -66701 -62969 57717 54167 -49647 “43655 -35258 *22459 
| 120 | -82918 | -81062 -78497 74677 -71942 -68259 62988 | -54709 -39479 
| 2 |1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 | 1-00000 =| 1-00000 | 
| 1 | Ee | _| 





For v,= 0, x=0 











180 


Percentage points of the incomplete beta-function 


Beta DIstRIBvuTION: 0-5 PER CENT PoINTS FOR % 











v= 2q V,=2p 
| ‘ie 1 2 3 4 5 6 7 8 9 
| Ys 
| 1 | 0-0461684 | 0-0425000 | 0-0415421 | 0-0411111 | 0-0586745 | 0-0571112 | 0-0560240 | 0-0552245 | 0-0546121 | 
2 | -0099750| -0050000| -0033361| -0025031| -0020030| -0016695| -0014311| -0012524| -0011133| 
3 | -051237 | -029240 | -020632 | -015976 | -013046 | -011028 | -0095530| -0084269| -0075388 
| 4| -11321 | -070711 | -052099 | -041400 | -034399 | -029445 | -025748 | -022881 | -020592 
| 5 |0-17996 |0-12011 |0-091593 |0-074378 |0-062737 |0-054301 |0-047891 |0-042849 | 0-038776 
| 6| -24356 | -17100 | -13408 | -11088 | -094759 | -0s2829 | -073619 | -066279 | -060287 
7 -30126 *22007 -17656 -14830 -12818 -11303 -10116 -091593 -083707 
8 -35261 *26591 °21745 *18510 ~ -16159 -14360 -12933 -11770 -10804 
9 | -39799 | -30808 | -25604 | -22046 | -19415 | -17373 | -15736 | -14389 | -13261 
10 |0-43809 {0-34657 |0-29204 |0-25399 |0-22542 |0-20297 |0-18478 |0-16970 |0-15697 
11 | -47360 | -38162 | -32543 | -28554 | -25517 | -23105 | -21132 | -19484 | -18083 
12 | -50517 | -41352 | -35632 | -31509 | -28332 | -25783 | -23682 | -21914 | -20402 
13 | -53337 | -44258 | -38487 | -34270 | -30986 | -28328 | -26120 | -24950 | -20642 
14 | -55865 | -46912 | -41127 | -36848 | -33484 | -30739 | -28444 | -26489 | -24798 
15 |0-58144 |0-49340 |0-43569 |0-39255 |0-35833 |0-33022 |0-30656 |0-28629 | 0-26869 
16 -60206 *51567 -45832 -41503 -38042 -35180 *32757 -30672 -28852 
17 -62080 -53616 -47932 -43605 “-40120 -37221 -34754 -32620 *30752 
18 -63789 *565505 -49885 *45571 -42076 “39151 -36650 -34478 -32568 
19 | -65354 | -57251 | -51703 | -47414 | -43917 | -40976 | -38451 | -36248 | -34305 
20 |0-66792 |0-58870 |0-53400 |0-49144 |0-45654 |0-42705 |0-40162 |0-37936 |0-35966 
21 -68117 -60375 -54986 -50768 -47293 -44343 -41789 -39546 +37554 
22 | -69341 | -61775 | -56472 | -52297 | -48841 | -45896 | -43336 | -41082 | -39073 
23 -70477 -63083 -57866 -53738 -50306 -47369 -44810 *42547 *40527 
24 | -71532 | -64305 | -59176 | -55098 | -51692 | -48769 | -46213 | -43947 | -41918 
25 | 0°72516 0-65451 0-60409 0-56382 0-53007 0-50100 0-47551 0-45285 0-43251 
26 *73434 *66527 -61571 -67597 -54255 -51367 *48827 -46564 +44528 
7 *74294 -67539 -62669 -58749 -55441 *62574 -§60046 -47788 -45752 
28 *75100 -68492 -63707 -59841 -56568 -53724 -51210 -48960 *46927 
29 *75857 -69392 -64690 -60878 -57642 *54822 *§2324 -50083 -48054 
30 | 0-76570 0-70242 0-65622 0-61864 0-58665 0-55871 0-53389 0-51159 0-49137 
40 *81920 *76727 -72823 -69571 -66744 *64229 -61956 -59882 -57973 
60 *87598 ‘83811 *80877 -78370 -76142 -74119 -72256 *70526 -68907 
120 -93619 -91548 -89893 *88442 *87120 *85892 *84739 *83645 *82602 
co |1-00000 | 1-00000 |1-00000 |1-00000 |1-00000 |1-00000 |1-00000 |1-00000 | 1-00000 









































This table gives the values of x for which I, (p, g)=0-005 where p= }1,, = $y. 























CATHERINE M. THOMPSON 181 
Beta DIistrrsuTion: 0-5 PER CENT PoINTs FOR x 
v= 2g ¥,=2p 
% 10 12 15 20 24 30 40 60 120 
Vg 
1 | 0-0841280 0-0534116 | 0-0527067 | 0-0520132 | 0-0516707 |0-0513310 | 0-0°99411 | 0-0°65998 | 0-0832862 
| 2 | -0010020| -0°83507 | -0°66812 | -0°50113 | -0°41762 | -0933411 | -0°25060 | -0°16707| -0483539 
| 3 -0068204| -0057290| -0046206] -0034943| -0029242] -0023493] -0017696| -0011848]| -0359503 
| 4 | -018721 | -015844 | -012879 | -0098197]| -0082522] -0066584] -0050373| -0033880]| -0017093 
| 
| 5 | 0-035415 0-030191 |0-024729 |0-019006 |0-016040 |0-012998 | 0-0098777 | 0-0066743 | 0-0033833 
6 | -055299 | -047464 | -039162 | -030337 | -025709 | -020924 | -015973 | -010844 | -0055240 
| 7 | -077090 | -066593 | -055329 | -043189 | -036749 | -030038 | -023034 | -015712 | -0080441 
8 | -099867 | -086787 | -072586. | -057076 | -048760 | -040023 | -030828 | -021129 | -010873 
9 | -12300 -10749 090464 | -071635 | -061433 | -050635 | -039174 | -026976 | -013953 
| 10 |0-14606 ca 0-10862 |0-086595 |0-074540 |0-061684 |0-047930 |0-033162 |0-017241 
11 | -16876 | -14898 -12683 10175 -087903 | -073027 | -056985 | -039611 | -020700 
12 | -19092 | -16931 -14489 -11696 -10139 -084550 | -066252 | -046265 | -024302 
13 | -21242 | -18919 -16270 -13210 -11490 -096157 | -075662 | -053076 | -028022 
| 14] -23320 -20853 -18017 14710 12835 -10781 -085158 | -060005 | -031841 
| 
15 |0-25323 |0-22728 |0-19725 |0-16190 |0-14170 |0-11942 |0-094697 |0-067019 | 0-035743 
| 16 | -27248 24543 -21388 -17644 -15488 -13097 10424 -074093 | -039714 
| 17] -29098 | *26295 -23006 -19070 -16787 14241 11377 -081204 | -043741 
| 18 | -30872 | -27986 24576 20465 -18065 -15373 -12324 088334 | -047815 
| 19 | -32574 29615 -26099 | -21829 -19319 -16489 13265 -09546" | -051928 
| 20 |0-34206 | 0-31184 | 0-27575 | 0-23160 |0-20549 |0-17590 | 0-14198 |0-10259 | 0-056070 
| 21 | -35770 ‘32696 | -29004 | -24458 21753 -18673 15122 -10969 -060237 
| 99 72€9 ‘34151 | -30387 | -25723 -22932 -19738 -16036 -11676 064421 
| 23] -38707 | 35552 | -31725 | -26954 24084 -20784 -16938 -12380 -068619 
| 24 | -40087 -36901 | -33020 -28153 25210 -21811 -17829 -13078 -072825 
| 25 |0-41411 |0-38200 |0-34272 |0-29320 |0-26309 |0-22818 /|0-18707 |0-13772 |0-077036 | 
26 | -42682 39452 | -35484 -30456 -27383 -23806 -19573 14461 -081248 
27 | -43903 40658 | -36657 -31560 28432 24775 20426 -15143 -085459 
28 | -45076 -41821 | -37792 -32635 29455 25724 21266 -15819 089665 
29 | -46204 42942 -38891 -33681 -30454 -26654 -22093 16489 093863 
30 |0-47289 |0-44024 |0-39954 |0-34698 |0-31429 |0-27565 |0-22907 |0-17152 |0-098050 
40 | -56205 53024 -48950 43493 -39980 -35700 -30341 -23388 -13907 
60 | -67384 64584 -60879 55688 -52194 -47762 -41913 33759 21421 
120 | -81604 79720 77125 73277 -70531 -66845 -61590 -53378 -38380 
co | 1-:00000 | 1-00000 |1-00000 | 1-00000 |1-00000 | 1-00000 |1-00000 | 1-00000 | 1-00000 
eee ae : ee —. 
































For v,= 0, r=0 























183 


TABLE OF LAGRANGIAN COEFFICIENTS FOR HARMONIC 
INTERPOLATION IN CERTAIN TABLES OF 
PERCENTAGE POINTS 


PREPARED BY L. J. COMRIE anp H. O. HARTLEY 


This table was prepared to facilitate interpolation in the tables of percentage points of the 
Incomplete Beta-Function in the preceding paper. As, however, the main part of the table 
may be applied to interpolation in other tables with a similar lay-out, it is published 
separately. 

The table is based on harmonic interpolation—a device introduced by R. A. Fisher in 
his tables of percentage points of the distribution of z, whose two parameters (degrees of 
freedom) n, and n, range from 1 to oo. It consists-in using values of n, and n, in harmonic 
progression, so that, with 1/n, and 1/n, as variables, z is tabulated at equidistant intervals 
near 1/n, = 9 and also near 1/n, = 0. This transformation renders the z-table (apart from 
its singularity at n, = 00, m, = 00) interpolable. As the percentage points of the Incomplete 
Beta-Function show a similar behaviour,* the values of vy, and v, near the margin of the 
tables have been chosen in harmonic progression. Harmonic interpolation is, in fact, 
applicable to any table of percentage points (depending on a parameter n with an infinite 
range) in which the statistic can be adequately represented as a polynomial in 1/n.+ It is 


Table of Lagrangian Coefficients 


Column headings are the arguments of tabular values and 
row headings the arguments of the interpolate. 





Ordinary 
7 8 9 10 | 12 | 15 | 20 


- + ~ - + - + 
ll 0-069 231 0-428 571 1-090 909 1-440 000 0-300 000 0-008 571 0-000 140 


8 9 10 12 15 20 24 
~ + ~ oa + - 

13 0-171 875 0-777 778 1-100 000 1-336 805 0-162 963 0-006 250 0-000 579 
14 0-223 214 0-969 697 1-285 714 1-041 667 0-507 936 0-011 364 0-000 992 


9 10 12 15 20 24 30 
- + - + ~ - 

16 | 0172391 | 0-448000 | 0-604938 | 1-238914 | 0:106909 | 0-017284 | 0-000 790 
17 | 0-306 397 | 0-780000 | 0-983025 | 1-258272 | 0-289546 | 0-040124 | 0-001 728 
18 | 0332468 | 0-833143 | 1-000000 | 1-024000 | 0:530182 | 0-057143 | 0-002 286 
19 | 0-222 222 | 0-550000 | 0-636574 | 0570370 | 0-787500 | 0-050926 | 0-001 852 


Harmonic 
10 12 15 20 24 30 40 
+ - + + + ~ 


16 0-003 052 0-039 551 0-692 139 0-769 043 0-632 813 0-247 192 0-039 062 
17 0-003 097 0-037 459 0-409 711 1-213 958 0-856 212 0-315 162 0-048 257 
18 0-001 996 0-022 993 0-201 189 1-341 259 0-735 777 0-251 486 0-037 160 





19 0-000 818 0-009 091 0-069 601 1-237 345 0-407 263 0-126 547 0-017 957 
































* This issue, pp. 168-81. 
+ It can be shown that any so-called “studentized” statistic has this property. 





Table of Lagrangian Coefficients (continued) 














21 
22 
23 


25 
26 
27 
28 
29 


31 (15-5) 
32 (16-0) 
33 (16-5) 
34 (17-0) 


35 (17-5) 
36 (18-0) 
37 (18-5) 
38 (19-0) 
39 (19-5) 


41 (20-5) 
42 (21-0) 
43 (21-5) 
44 (22-0) 


45 (22-5) 
46 (23-0) 
47 (23-5) 
48 (24-0) 
49 (24-5) 


50 (25-0) 
51 (25-5) 
52 (26-0) 
58 (26-5) 
54 (27-0) 


61 (30-5) 
62 (31-0) 
63 (31-5) 
64 (32-C) 


65 (32-5) 
66 (33-0) 
67 (33-5) 
68 (34-0) 
69 (34-5) 





0-000 730 
0-000 993 
0-000 927 
0-000 670 
0-000 335 


20 
(10) 

a 
0-003 664 
0-005 875 
0-006 875 
0-006 948 


0-006 358 
0-005 335 
0-004 062 
0-002 684 
0-001 305 


0-001 182 
0-002 210 
0-003 069 
0-003 754 


0-004 268 
0-004 619 
0-004 819 
0-004 883 
0-004 824 


0-004 659 
0-004 402 
0-004 068 
0-003 669 
0-003 220 


0-002 730 
0-002 210 
0-001 669 
0-001 116 
0-000 558 


+ 
0-000 552 
0-001 093 
0-001 619 
0-002 128 


0-002 616 
0-003 082 
0-003 524 
0-003 940 
0-004 329 





15 


0-010 497 
0-009 653 
0-004 908 


20 


0-040 858 
0-050 984 
0-044 488 
0-030 498 
0-014 607 


24 
(12) 
0-041 459 
0-063 446 
0-071 503 
0-070 033 


0-062 423 
0-051 212 
0-038 250 
0-024 848 
0-011 904 


} 
0-010 511 
0-019 448 
0-026 748 
0-032 432 


0-036 580 
0-039 302 


| 0-040 733 


0-041 016 
0-040 293 


0-038 707 
0-036 392 
0-033 473 
0-030 065 
0-026 273 


0-022 191 
0-017 901 
0-013 477 
0-008 983 
0-004 475 


0-004 402 
0-008 695 
0-012 854 
0-016 853 


0-020 675 
0-024 304 
0-027 730 
0-030 945 
0-033 942 








Harmonic 
20 24 

+ + 
0-629 840 | 0-537 463 
0-337 838 0-864 864 
0-130 868 1-005 068 

24 30 

+ + 
0-817 152 0-306 432 
0-611 813 0-573 575 
0-415 220 0-778 537 
0-243 980 | 0-914 925 
0-105 172 0-985 984 

30 40 

(15) (20) 

+ + 
0-906, 909 0-179 142 
0-793 076 0-352 478 
0-670 341 0-510 736 
0-547 129 0-648 450 
0-429 158 0-762 947 
0-320 073 | 0-853 529 
0-221 984 | 0-920 823 
0-135 887 | 0-966 308 
0-061 998 | 0-991 960 

~ a 
0-050 764 0-992 724 
0-091 I61 0-972 384 
0-122 164 0-941 117 
0-144 787 0-900 900 
0-160 037 0-853 529 
0-168 878 0-800 606 
0-172 219 0-743 549 
0-170 899 0-683 594 
0-165 679 0-621 808 
0-157 248 0-559 104 
0-146 218 0-496 255 
0-133 132 0-433 910 
0-118 464 0-372 604 
0-102 630 0-312 777 
0-085 989 0-254 781 
0-068 849 0-198 897 
0-051 474 0-145 339 
0-034 088 0-094 268 
0-016 878 0-045 797 

+ om 
0-016 417 | 0-043 083 
0-032 268 | 0-083 441 
0-047 471 | 0-121 085 
0-061 959 | 0-156 045 
0-075 683 | 0-188 368 
0-088 608 | 0-218 113 
0-100 709 | 0-245 348 
0-111 971 | 0-270 154 
0-122 388 | 0-292 605 








30 


0-209 947 
0-253 378 
0-168 259 


40 


0-108 954 
0-174 804 
0-191 640 
0-162 653 
0-095 611 


60 
(30) 


0-062 545 
0-113 297 
0-148 964 
0-168 348 


0-171 663 
0-160 037 
0-135 121 
0-098 827 
0-053 141 


ae 
0-058 780 
0-121 548 
0-186 839 
0-253 378 


0-320 073 
0-386 006 


| 0-450 419 


0-512 695 
0-572 346 


0-628 992 


| 0-682 351 
| 0-732 223 


0-778 476 
0-821 038 


0-859 886 
0-897 035 
0-926 536 
0-954 463 
0-978 914 


+ 
1-017 846 
1-032 585 
1-044 357 


1-053 303 


| 1-059 570 


1-063 300 
1-064 636 
1-063 721 


1-060 692 





0-029 184 
0-044 986 
0-047 184 
0-038 122 
0-021 204 


120 
(60) 

4 
0-016 304 
0-028 839 
0-026 984 
0-040 717 


0-040 391 
0-036 580 
0-029 955 
0-021 212 
0-011 022 


0-011 310 
0-022 440 
0-033 000 
0-042 674 


0-051 212 
0-058 422 
0-064 169 
0-068 359 
0-070 939 


0-071 885 
0-071 202 
0-068 915 
0-065 067 
0-059 712 


0-052 916 
0-044 752 
0-035 297 
0-024 631 
0-012 838 


+ 
0-013 801 


| 0-028 485 


0-043 973 
0-060 189 


0-077 060 
0-094 516 
0-112 490 
0-130 919 
0-149 745 





60 


0-008 075 
0-008 890 
0-005 305 


120 


0-003 686 
0-005 579 
0-005 740 
0-004 546 
0-002 477 


i 8) 
(2) 
0-002 015 
0-003 525 
0-004 469 





0-004 863 


0-004 768 
0-004 268 
0-003 453 
0-002 416 
0-001 240 


> 
0-001 241 
0-002 431 
0-003 529 


0-004 505 | 


0-005 335 
0-006 005 
0-006 506 
0-006 836 
0-006 995 


0-006 989 
0-006 824 
0-006 509 
0-006 055 
0-005 474 


0-004 777 
0-003 978 
0-003 088 
0-002 121 
0-001 088 


0-001 131 
0-002 295 
0-003 481 
0-004 681 


0-005 886 
0-007 089 
0-008 281 
0-009 455 
0-010 607 











Table of Lagrangian Coefficients (continued) 


| 

Harmonic 

20 24 30 | 40 60 120 | 20 

(30) | =) | (5) | (20) (30) | (60) | (@) 

| 70 (35-0) | 0-004692 | 0-036 719 | 0-131960 | 0-312795 | 1-055 683 | 0168909 | 0-011 730 

| 71 (35-5) 0-005 027 | 0-039 275 | 0-140 696 | 0-330 812 1-048 823 | 0-188 360 | 0-012 819 

| 72 (36-0) | 0-005 335 | 0-041610 | 0-148 605 | 0-346 746 1-040 238 | 0-208048 | 0-013 870 

73 (36-5) 0-005 615 0-043 725 0-155 705 0-360 690 1-030 048 | 0227925 | 0-014 878 

74 (37-0) 0-005 867 | 0-045623 | 0-162013 | 0-372 737 1-018 370 | 0-247952 | 0-015 84} 

| 

| 75 (37-5) | 0006093 | 0-047 309 | 0-167552 | 0-382976 | 1-005 312 268 083 | 0-016 755 

| 76 (38-0) ; 0-006 292 | 0-048 787 | 0-172 345 0-391 499 0-990 981 0-288 285 | 0-017 617 

| 77 (38-5) | 0-006 465 | 0-050 062 | 0-176 416 0-398 393 0-975 477 0-308 523 | 0-018 426 

| 78 (39-0) | 0-006613 | 0-051 141 | 0-179793 | 0-403 745 | 0-958 894 | 0-328 764 | 0-019 178 

| 79 (39-5) 0-006 736 0-052 030 0-182 502 | 0-407 639 | 0-941 324 0-348 979 0-019 872 
80 (40-0) | 0-006 836 | 0-052 734 0-184.570 | 0-410 156 0-922 851 0-369 141 | 0-020 508 | 

| 81 (40-5) | 0-006 912 | 0-053 262 0-186 027 | 0-411 376 0-903 557 0-389 225 | 0-021 083 | 

| 82 (41-0) | 0-006 967 | 0-053621 | 0-186 898 | 0-411373 | 0-883518 | 0-409 208 | 0-021 597 | 

; 83 (41-5) | 0-007000 | 0-053 816 | 0-187 212 | 0-410 223 | 0-862805 | 0-429071 | 0-022049 | 
84 (42-0) | 0-007 012 


0-053 855 | 0-186997 | 0-407 993 0-841 486 | 0-448 793 | 0-022 440 | 


} | 
85 (42-5) | 0-007 005 | 0-053 746 | 0-186 279 | 0-404753 | 0-819625 | 0-468357 | 0-022 767 | 
| 86 (43-0) | 0-006 980 | 0-053 495 | 0-185083 | 0-400567 | 0-797283 | 0-487749 | 0-023 033 | 
87 (43-5) | 0-006 936 | 0-053 110 | 0-183 437 | 0-395496 | 0-774514 | 0-506954 | 0-023 235 | 
| 88 (44-0) | 0-008875 | 0-052596 | 0-181 366 | 0-389600 | 0-751 371 | 0-525 960 | 0-023 376 
89 (44-5) | 0-008 798 | 0-051 961 | 0-178 892 | 0-382 934 | 0-727905 | 0-544755 | 0-023 455 
| 


| 90 (45-0) | 0-006 706 | 0-051 212 | 0-176040 | 0-375552 | 0-704 161 | 0-563 329 | 0-023 472 


91 (45-5) | 0-006 600 | 0-050 354 
| 92 (46-0) | 0-006 479 
| 93 (46-5) | 0-006 346 


94 (47-0) | 0-006 200 


0-172 833 | 0-367506 | 0-680 182 0-581 673 | 0-023 428 


0-049 393 | 0-169 293 0-358 843 | 0-656 009 0-599 780 0-023 325 
0-048 337 0-165 440 | 0-349609 | 0-631 680 | 0-617 642 0-023 162 


0-047 190 | 0-161 295 | 0-339 848 | 0-607 229 0-635 254 0-022 940 


} 
| 
| 95 (47-5) | 0006043 | 0-045959 | 0-156878 | 0-329602 | 0-582689 | 0-652611 | 0-022 660 
( . i 
96 (48-0) | 0-005 875 0-044 647 0-152 207 0-318 909 0-558 090 0-669 708 0-022 324 
| 97 (48-5) 0-005 696 | 0-043 262 | 0-147 299 | 0-307806 | 0-533 462 | 0-686542 | 0-021 931 
| 98 (49-0) | 0-005 509 | 0-041 806 | 0-142173 | 0-296 330 | 0-508 829 | 0-703 109 | 0-021 484 
99 (49-5) | 0-005312 | 0-040 287 | 0-136 844 | 0-284511 0-484 217 0-719 408 | 0-020 983 
| j 
| 


| 100 (50-0) 0-005 107 0-038 707 0-131 328 0-272 384 0-459 648 0-735 437 0-020 429 
| 101 (50:5) | 0-004 894 0-037 072 0-125 649 0-259 976 0-435 143 0-751 194 0-019 823 
102 (51-0) | 0004675 | 0-035 385 0-119 793 0-247 316 0-410 721 0-766 679 0-019 167 
103 (51-5) | 0-004 448 0-033 651 | 0-113 802 | 0-234429 | 0-386 400 | 0-781 891 0-018 461 

' 

| 

| 


104 (52-0) 0-004 216 0-031 873 | 0-107680 | 0-221 342 0-362 196 0-796 831 0-017 708 


105 (52-5) | 0-003 978 | 0-030056 | 0-101 437 | 0-208077 | 0-338125 | 0-811499 | 0-016 906 
| 106 (53-0) | 0-003 735 | 0-028 201 | 0-095 087 0-194 656 0-314 199 0-825 895 0-016 059 
107 (53-5) | 0-003 486 | 0-026 314 | 0-088 639 | 0-181 099 | 0-290433 | 0-840022 | 0-015 167 
| 108 (54-0) 0-003 234 | 0-024397 | 0-082 104 | 0-167427 | 0-266837 | 0-853 880 | 0-014 231 
| 109 (54-5) 0-002 978 0-022 452 | 0-075 492 0-153 658 0-243 423 0-867 470 0-013 253 
j 
| 


| 110 (55-0) 
|} 111 (55-5) | 0-002 456 | 0-018 494 | 0-062074 | 0-125896 | 0-197175 | 0-893 858 | 0-011 173 
| 112 (56-0) | 0-002 190 | 0-016 485 | 0-055 284 | 0-111.933 | 0-174358 | 0-906 660 | 0-010 074 
| 113 (56-5) | 0-001 922 | 0014459 | 0-048 452 | 0-097 936 | 0-151 755 | 0-919 203 | 0-008 937 
114 (57-0) | 0-001 651 | 0-012 420 0-041 584 0-083 918 0-129 374 0-931 491 | 0-007 762 
| 
| 
| 
| 


0-002 719 0-020 484 | 0-068 812 0-139 809 0-220 199 0-880 796 0-012 233 


| 115 (57-5) 0-001 379 | 0-010 368 | 0-034 688 0-069 891 0-107 219 | 0-943 525 0-006 552 
| 116 (58-0) | 0-001 106 | 0-008 307 | 0-027770 | 0-055 366 |. 0-085 295 | 0-955 309 | 0-005 307 
| 177 (58-5) | 0-000 831 | 0-006 238 | 0-020 837 | 0-041 854 | 0-063 608 | 0-966845 | 0.004 029 
| 118 (69-0) | 0-000 554 | 0-004 162 | 0-013 894 | 0-027 867 | 0-042 161 0-978 137 0-002 717 | 
| 119 (59-5) | 0-000 278 | 0-002 083 0-006 947 | 0-013 913 0-020 957 | 0-989 188 | 0-001 374 | 
Eo ————_——_-_1 EE - _-- —_-~ — ——! 


Biometrika xxx 13 











186 Lagrangian coefficients for harmonic interpolation 


particularly convenient if the high-order terms are so small that linear interpolation 
suffices. Unfortunately, however, if the interpolate is required to the same accuracy as the 
tabular values, linear interpolation in the above-mentioned tables is inadequate; hence this 
table has been prepared as the simplest method of preserving tabular accuracy in the 
interpolates. The interpolate is the sum of seven products of which the seven (Lagrangian) 
multipliers are taken from this table whilst the multiplicands are tabular entries in 
the table of percentage points. The examples below illustrate the use of the table. 

The calculation of the Lagrangian coefficients follows the standard formule for inter- 
polation by a polynomial of the sixth degree. Where interpolation is harmonic the coeffi- 
cients are those for polynomials of the sixth degree in the reciprocal of the parameters 
used as argument. 

In the early part of the table, however, ordinary Lagrangian coefficients are given, 
since the polynomial in the parameter itself is preferable in this range. The part of the table 
with row headings less than 30 has been specifically designed to meet the requirements of 
the tables of percentage points of the Incomplete Beta Function. In particular it will be 
noted that there are two rows for 16, 17, 18 and 19, one giving harmonic and the other 
ordinary Lagrangian coefficients; the application of both rows affords a good check, as 
will be seen from Example 2 on p. 162 in the preceding paper. 

The table may be used not only for the progression 10, 12, 15, 20, 24, 30, 40, 60, 120, «0, | 
but also for submultiple progressions. The most important of these, namely 10, 12, 15, 20, 
30, 60, 00, is obtained by halving the last seven terms, and is catered for by the auxiliary 
arguments in brackets. Division by 4 yields the progression 5, 6, 7-5, 10, 15, 30, co, while 
division by 5 yields 3, 4, 4-8, 6, 8, 12, 24, oo, from which we can select the first seven or the 
last seven values. 

The missing values 7-5 or 4:8 can be found by ordinary interpolation from values in 
their immediate neighbourhood. If linear interpolation does not suffice we may use 


f(7-5) = Pe{ —f(8) + 9f(7) + Of(8) —f(9)}, 


which takes third differences into account, and 
f(4-8) = 0-12f(4) + 0-96 f(5) — 0-08 (6), 


which takes second differences into account. Since f(7-5) and f(4-8) are not required to full 
tabular accuracy these formule will, as a rule, suffice. 


Example 1. Find the 0-5 % point of the Incomplete Beta Function corresponding to 

v, = 4 and v, = 96. 
In the accompanying table enter row 96, which gives the Lagrangian multipliers. The 
corresponding multiplicands are taken from the column of the table of 0-5 % points on 
(96, 4) = + 0-491 44 x 0-005 875 =p. 180 and are the entries for v, = 20, 24, 30, 40, 60, 120 
0-560 98 x 0-044 647 and 0, which correspond to the colamn headings in the 
<a a etm a Lagrangian table. The sign of each product is also given 
40-783 70 x 0-558 090  &t the top of the columns. We have, therefore, the scheme 
+0-884 42 x 0-669 708 shown alongside. The result is two units greater in the 
— 1-000 00 x 0-022 32 fifth decimal than the exact value obtained by inverse 

= 0-857 94 interpolation in Pearson’s tables. 


Ezample 2. In Fisher’s table of 1 % points of the distribution of z find the point 
2(12, 54) = +0-7744 x 0-003 23 =~ corresponding to n, = 12 and n, = 54. 
— 0-7122 x 0-024 40 


94008 20-408 10 In Fisher’s table we have the harmonic progression 

05864 x 0-167 43 10, 12, 15, 29, 30, 60, co, for which our table provides 

40-5224 x 0-266 84 PY means of the arguments in brackets. Using the 

+0°4574 x 0-853 88 Lagrangian multipliers, and the tabular entries corre- 

- 03908 x 001423 sponding to the bracketed column headings in row (54), 
= 04647 we have the values alongside. 





TABLE OF PERCENTAGE POINTS OF THE 
x2 DISTRIBUTION 


CALCULATED By CATHERINE M. THOMPSON 


EDITORIAL 


In the calculation of the percentage points of the incomplete B-function 
I( p,q) for large values of p and q described in the preceding paper, use has been 
made of the relation between this function and the incomplete I-function 
I(u, p*)} which was tabulated by Karl Pearson (2). Since the incomplete J-func- 
tion is related to the probability integral of y?, it was decided that the calculation 
of the percentage points of u already carried out should be extended to form 
complete tables of percentage points of x. Before describing the method of com- 
putation used in deriving these tables it is desirable to relate them to existing 
tables and to define the relation between the functions. 

In common terminology the probability distribution of x* having v degrees 
of freedom may be written 

foe) = 2 Gayetese, (1) 
: I'(3v) 

The probability integral of x? or the chance that this quantity exceeds a 

given value y? is then 


P= B(x’) = | “foe)dy’. (2) 
x 


Conversely, for given degrees of freedom v, the integral (2) will be equal to 
a given probability level P for one particular lower limit x*. This lower limit is 
called the percentage point y2 corresponding to v and P; it will be denoted by x2(P). 

These percentage points y2(P) were first tabulated by R. A. Fisher() for 
P = 0-99, 0-98, 0-95, 0-90, 0-80, 0-70, 0-50, 0-30, 0-20, 0-10, 0-05, 0-02, 0-01f and 
for vy = 1(1)30. Most entries in the body of Fisher’s table are given to three 
decimal accuracy (i.e. four to five significant figures) but more decimals are given 
for small percentage points which are given to three-figure accuracy. In the table 
which follows x?(P) is tabulated for P = 0-995, 0-99, 0-975, 0-95, 0-90, 0-75, 0-50, 
0:25, 0-10, 0-05, 0-025, 0-01, 0-005 (which are the levels used for [,(p, q)) whilst 
the range of the degrees of freedom has been extended up to v = 100. The per- 
centage points are given to six significant figures, although for y> 50 the sixth 
figure may be in error by one or two units. 

t Karl Pearson’s notation was I(u, p). To avoid confusion with the parameter p in I,(p, g) we 


have added the asterisk. 
¢ In a later edition the level P= 0-001 was added. 


13-2 








188 


Table of percentage points of the x” distribution 


TABLE OF PERCENTAGE POINTS OF THE x? DISTRIBUTION 





























oe 0-995 0-990 0-975 0-950 0-900 0-750 
1 | 392704.10-1| 157088. 10-* | 982069. 10-* | 393214.10-* | 0-0157908 | 0-1015308 
2 | 00100251 | 0-0201007 | 00506356 | 0-102587 | 0-210720 | 0-575364 
3 | 00717212 | 0-114832 | 0-215795 | 0-351846 | 0584375 | 1-212534 
4 | 0206990 | 0-297110 | 0-484419 | 0-710721 | 1-063623 | 1-92255 
5 | 0-411740 | 0554300 | 0-831211 | 1145476 | 1-61031 2-67460 
6 | 0-675727 | 0-872085 | 1237347 | 1-63539 2-20413 3-45460 
7 | 0989265 | 1-239043 | 1-68987 2-16735 2-83311 4-25485 
8 | 1344419 | 1-646482 | 2-17973 2-73264 3-48954 5-07064 
9 | 1-734926 | 2087912 | 270039 3-32511 4-16816 5-89883 
10 | 2+15585 2-55821 3-24697 3-94030 4-86518 6-73720 
11 | 2-60321 3-05347 3-81575 4-57481 557779 7-58412 
12 | 3-07382 3-57056 4-40379 5-22603 6-30380 8-43842 
13 | 3-56503 4-10691 5-00874 5-89186 7-04150 9-29906 
14 | 4-07468 4-66043 5-62872 6-57063 7-78953 | 10-1653 
15 | 4-60094 5-22935 6-26214 7-26094 8-54675 | 11-0365 
16 | 514224 5-81221 6-90766 7-96164 9-31223 | 11-9122 
17 | 5-69724 6-40776 7-56418 867176 | 10-0852 12-7919 
18 | 6-26481 7-01491 8-23075 9-39046 | 10-8649 13-6753 
19 | 6-84398 7-63273 890655 | 10-1170 11-6509 14-5620 
20 | 7-43386 8-26040 9-59083 | 10-8508 12-4426 15-4518 
21 | 8-03366 8-89720 | 10-28293 | 11-5913 13-2396 16-3444 
22 | 8-64272 9-54249 | 10-9823 12-3380 14-0415 17-2396 
23 | 926042 | 10-19567 | 11-6885 13-0905 14-8479 18-1373 
24 | 989623 | 10-8564 12-4011 13-8484 15-6587 19-0372 
25 | 10-5197 11-5240 13-1197 14-6114 16-4734 19-9393 
26 | 11-1603 12-1981 13-8439 15-3791 17-2919 20-8434 
27 | 11-8076 12-8786 14-5733 16-1513 18-1138 21-7494 
28 | 12-4613 13-5648 15-3079 16-9279 18-9392 22-6572 
29 | 13-1211 14-2565 16-0471 17-7083 19-7677 23-5666 
30 | 13-7867 14-9535 16-7908 18-4926 20-5992 24-4776 
40 | 20-7065 22-1643 24-4331 26-5093 29-0505 33-6603 
50 | 27-9907 29-7067 32-3574 34-7642 37-6886 42-9421 
60 | 35-5346 37-4848 40-4817 43-1879 46-4589 52-2938 
70 | 43-2752 45-4418 48-7576 51-7393 55-3290 61-6983 
80 | 51-1720 53-5400 57-1532 60-3915 64-2778 711445 
90 | 59-1963 61-7541 65-6466 69-1260 73-2912 80-6247 
100 | 67-3276 70-0648 74-2219 77-9295 82-3581 90-1332 
yp | 25758 | -2-3263 |-1-9600 | —1-6449 -1:2816 | —0-6745 





For 30 <v< 100 interpolation formulae (6) or (7) of the Introduction may be used. 

















CATHERINE M. THOMPSON 189 
TABLE OF PERCENTAGE POINTS OF THE y? DISTRIBUTION (continued) 
i” < - 
m4 0-500 0-250 0-100 0-050 | 0-025 0-010 0-005 
1 0-454937 | 1-32330 | 2-70554 | 3-84146 | 5-02389 | 6-63490 | 7-87944 
2 1+38629 | 2°77259 | 4-60517 599147 | 7-37776 | 921034 | 10-5966 
3 2-36597 410835 | 6-25139 | 7-81473 | 9-34840 | 11-3449 12-8381 
4 335670 | 5:38527 | 7-77944 | %-48773 | 11-1433 13-2767 14-8602 
5 4-35146 | 6-62568 | 9-23635 | 11-0705 | 12-8325 15-0863 16-7496 
6 534812 | 7-84080 | 10-6446 | 12-5916 | 14-4494 16-8119 18-5476 
7 6-34581 9-03715 | 12-0170 | 14-0671 | 16-0128 18-4753 | 20-2777 
8 7-34412 | 10-2188 13-3616 | 15-5073 | 17-5346 | 20-0902 | 21-9550 
9 8-34283 | 11-3887 14-6837 16-9190 19-0228 | 21-6660 | 23-5893 
10 9-34182 | 12-5489 15-9871 18-3070 | 20-4831 23-2093 | 25-1882 
11 | 10-3410 13-7007 17-2750 19-6751 21-9200 | 24-7250 | 26-7569 
12 | 11-3403 14-8454 18-5494 | 21-0261 23-3367 26-2170 | 28-2995 
13 | 12-3398 15-9839 19-8119 | 22-3621 24-7356 | 27-6883 | 29-8104 
14 | 13-3393 17-1170 | 21-0642 | 23-6848 | 26-1190 | 29-1413 | 31-3193 
15 | 14-3389 18-2451 22-3072 | 24-9958 | 27-4884 30-5779 | 32-8013 
16 | 15-3385 19-3688 | 23-5418 | 26-2962 | 28-8454 | 31-9999 | 34-2672 
17 | 16-3381 20-4887 | 24-7690 | 27-5871 30-1910 | 33-4087 | 35-7185 
18 | 17-3379 | 21-6049 | 25-9894 | 28-8693 | 31-5264 | 34-8053 | 37-1564 
19 | 18-3376 | 22-7178 | 27-2036 | 30-1435 | 32-8523 36-1908 38-5822 
20 | 19-3374 | 23-8277 28-4120 | 31-4104 | 34-1696 | 37-5662 | 39-9968 
21 | 20-3372 24-9348 | 29-6151 32-6705 | 35-4789 | 38-9321 41-4010 
22 | 21-3370 | 26-0393 | 30-8133 | 33-9244 | 36-7807 | 40-2894 | 42-7956 
23 | 22-3369 27-1413 | 32-0069 | 35-1725 | 38-0757 | 41-6384 | 44-1813 
24 | 23-3367 28-2412 | 33-1963 | 36-4151 | 39-3641 | 42-9798 | 45-5585 
25 | 24-3366 29-3389 | 34-3816 | 37-6525 | 40-6465 | 44-3141 46-9278 
26 | 25-3364 30-4345 | 35-5631 38-8852 | 41-9232 | 45-6417 48-2899 
27 | 263363 | 31-5284 | 36-7412 | 40-1133 | 43-1944 | 46-9630 | 49-6449 
28 | 27-3363 | 32-6205 | 37-9159 | 41-3372 | 44-4607 | 48-2782 | 50-9933 
29 | 28-3362 | 33-7109 | 39-0875 | 42-5569 | 45-7222 | 49-5879 | 52-3356 
| | | 
30 | 29-3360 | 34-7998 | 40-2560 | 43-7729 | 46-9792 | 50-8922 53-6720 
40 | 39:3354 | 45-6160 | 51-8050 | 55-7585 | 59-3417 | 63-6907 66-7659 
50 | 49-3349 | 563336 | 63-1671 67-5048 | 71-4202 | 76-1539 | 79-4900 
60 | 59-3347 66-9814 74-3970 | 79-0819 | 83-2976 | 88-3794 | 91-9517 
70 | 69-3344 | 77-5766 | 85-5271 90-5312 | 95-0231 | 100-425 104-215 
80 | 79-3343 | 88-1303 | 96-5782 | 101-879 | 106-629 112-329 116-321 
90 | 89-3342 | 98-6499 | 107-565 113-145 =| 118-136 124-116 128-299 
100 | 99-3341 | 109-141 118-498 124-342 | 129-561 135-807 140-169 
ye | 00000 | +0-6745 | +1-2816 | +1-6449 | +1-9600 | +2-3263 | +2-5758 
\ 






































2 2)3 


/2)\ 9 LMS ’ 
TY A/G OF XP)= Hye t V2v— 1), 
according to the degree of accuracy-required. 


For v> 100 take x3(P) =of1- 











190 Table of percentage points of the x? distribution 
The relation with the incomplete J’-function is given by 
P(x?) = 1—I(u, p*) (3) 
where x? = 2u,/(p* +1), (4) 
the degrees of freedom, v, of x? being given by 


y= 2p* + 2. (5) 
lhe above relations were used for the computation of x°(P). Most of the 
entries were obtained from the Tables of the Incomplete I'-Function (2). In these 
tables the column headed p* = 4v— 1 was entered, the root wu of 


I(u, p*) =1-—P 
found by inverse interpolation and transformed into the corresponding percen- 
tage point for y? by substitution in equation (4). 
Although for small values of p* and u the table of J(u, »*) is not interpolable 
formal inverse interpolation in this range of the table still yields approximate 
values of the percentage points. To make these accurate, auxiliary tables of 


»/ 


(y?) were constructed for arguments x? in the neighbourhood of the approximate 


~ 


percentage points, and the exact values of x?(P) found by inverse interpolatio 
i 1ese auxiliary tables. The latter were constructed from the expansion « 
in these auxiliary tables. The latter were constructed from the exp n 
P(x?) given on p. xxxi of Tables for Statisticians and Biomeiricians, Part 1(3) 
since the auxiliary tables were required only for small values of y? a few terms it 
the expansions were sufficient to yield Py?) to the required accuracy. 


Whilst the existing table of percentage points of y*(1) is confined to the ran 





ge 
v = 1(1) 30, the range vy = 30 (10) 100 has been added in the table below. This ha: 
been done because the customary approximation to x2(P) by the corresponding 
normal deviates is not very satisfactory in this range of v. There are, i 
percentage points which differ from the approximate ones in the second signifi 
figure. In our table, however, linear interpolation (which is particularly 
venient at interval 10) yields interpolates accurate to about four sig ant 
figures. If we write vy = 10k+m with 3<k<10 and 0<m<10 then we ha 

X> = vo{(10—m) Xion + MXijon+105- (6) 


For instance fer vy = 54 and P = 0-01 we have 


Y5q(0°01) = 74 {6x2,(0-01) + 4y2,(0-01)} = 81-04 
If higher accuracy is required we have to use the four point Lagrangian formula, 
viz. 


2 eT _— 2 
Xy = L_Xion-10+ LoXton + Ly Xion+10 + Le Xion+-20 


oj 


where the Lagrangian coefficients L_,, Ly, L, and L, are tabulated below 


m Li i 
= i diy i 
_ + + - 
0 0-0000 11-0000 0-O00K 0-OOXK 10 


-— 


© 
13 


0-0285 














2 00-0480 +8540 0 0320 s 
3 0-0595 0-7735 { 455 7 
4 00-0640 0-6 { is 5 
5 0-0625 0-5625 ( 6s ( 
Lig 
Raturni ae 4] tT —— 
returning to the above exami ve ta 
2 ft RA 2 
Y54\ j VUO0e Y40\ ; Ul) ‘ t | 
81-069 
t $ ~— 
tor L100 we can make use i Ss apy c ) by the 1al 
b ity integral and calculate y? i form 
27 J 
Yi £ . 8) 
\ e€ tne no ? DeT 
C i i » S 2 } ax 
are given ll € t — 1% his a simatic 
H 4CCcCurac as Se vecom more 
irate 
; 1 
more ac DE v¥1lison a tyi4 
this assumes that (Xx°/¥)" 18 ne rmally distributed abc ut — 2/(9v) dar 
a QO. r % : . 
aeviat on ,/(2/9v) That i to say, the probabilit iS m De riculat f I 














‘ : ; ; . 
Compai numerical values showing the relative accuracy of thes¢ 
lz are given in the Note by Mrs M. pp. 200-2 below 
REFERENCES 
1) Fisner, R. A. (1925). Statistical Methods for Researzh Workers. } burg! liver 
i Bovd 
P g wT, K lables of the Incomplete I’-function. London: 
PEARSO K Tables for Statisticians and Biometri s. Part I. London 
B ik 











192 


MISCELLANEA 


(i) Theory of Probability. By Harotp Jrerrreys. Oxford University Press. 1939. 
7+380 pp. 21s. net. 


In the history of the application of probability theory to the problem of drawing inferences 
from observations, no set of ideas has played a more controversial role than that asso- 
ciated with inverse probability. Various leaders of modern thought in statistical inference 
have pointed out the logical difficulties inherent in the application of inverse probability. 
R.A. Fisher, through his methods of maximum likelihood and of fiducial limits, has intro- 
duced principles of statistical inference which make the introduction of the notion of inverse 
probability irrelevant. These principles have been extended and refined by J. Neyman, 
E. 8. Pearson, A. Wald and others, until now we have available in statistical literature a 
self-consistent discipline of statistical inference which is independent of inverse probability. 

In the present book the author proposes a system of statistical inference based on the 
principles of inverse probability, applying it to the same problems which have been treated 
by Fisher, Neyman, Pearson and others without using inverse probability. The attitude 
which the author takes towards probability theory is somewhat similar to that taken by 
J.M. Keynes. Probability is regarded as a subjective phenomenon. The essential idea is that 
probability is a matter of comparing ‘reasonable degrees of belief’ in propositions. In 
Chapter 1 the author goes through a considerable amount of psychological and philosophical 
discussion attempting to justify this approach. This discussion is finally formalized by a set 
of six axioms. The primitive or undefined notion is that of the relation ‘given p, q is more 
probable than r’, where p, g, and r are propositions. The symbol used for denoting the prob- 
ability of q given p is P(q| p). The six axioms which are used are as follows: 

(1) Given p, q is either more or less probable than r, or both are equally probable; and no 
two of these alternatives can be true. 

(2) If p, g, r, s are four propositions and given p, g is more probable than r and r is more 
probable than s, then given p, g is more probable than s. 

(3) All propositions deducible from a proposition p have the same probability on data p; 
and all propositions inconsistent with p have the same probability on data p. 

(4) If, given p, q and q’ cannot both be true, and if, given p, r and r’ cannot both be true, 
and if, given p, q and r are equally probable and q’ and 7’ are equally probable, then given 
p, ‘gor q’’ and ‘r or r’’ are equally probable. 

(5) The set of possible probabilities on given data, ordered in terms of the relation ‘more 
probable’, is not of higher ordinal type than the continuum including the end-points. 

(6) If pq entails r, then P(gr |p) = P(q| p). 

In axiom 6, the expression ‘a entails b’ is defined as meaning ‘a is deducible from 6’, or 
‘a is identical with b’, or ‘a is identical with some proposition asserted in 6’. The expression 
‘ab’ is taken as the logical product, that is ‘both a and 6’. 

The introduction of numbers for expressing probabilities is made through three ‘con- 
ventions’. The first convention associates the larger of two numbers with the more probable 
of two propositions, the second one states that if given p, g and q’ are mutually exclusive 
then P(q| p)+ P(q’| p) = P(q or q’ | p), and the third states that if p entails g, then P(q| p) = 1. 

In order to be thoroughly rigorous in his axiomatic approach, presumably the author 
should have postulated the existence of an aggregate of propositions on which to operate. 
The possibility of an infinite number of propositions should, of course, not be excluded, as 
will be seen when the author applies his theory to problems involving continuous random 
variables, In the case of an infinite number of propositions, it appears that the assumption 
should be made that axiom 4 wouid hold in case of two infinite sets of mutually exclusive 
alternatives. A similar assumption would have to be made for the second convention. The 
proponent of the measure theory approach to probability has essentially the same problem 
to deal with, but he handles it by assuming the existence of set functions which are completely 
additive over his postulated field of sets. 


Miscellanea 193 


Proceeding from his six axioms and three conventions the author completes his chapter 
by establishing twelve theorems which are used as the basis for the work in the subsequent 
chapters. The question of an infinite number of alternatives is not covered by these theorems. 

Chapter 11, on ‘Direct Probabilities’, is devoted to derivations and discussions of the 
binomial, normal, Poisson, Pearson, multinomial, Chi-square, t, z and other frequency laws 
and their properties. The characteristic function and illustrations of its use in the deter- 
mination of probability laws are presented. 

In building up his system of statistical inference Jeffreys proceeds in Chapters I11—-viII 
by applying the principle of inverse probability to the results of Chapter m. Thus if S is a 
set of observations subject to a given discrete distribution law P(S|0H) derived under a 
distribution hypothesis H, where @ is a parameter to be estimated, an a priori probability 
function g(@) dé for 6 is introduced. The posterior probability law of 6 given S, say P(@| SH), 


is given by 
P(0| SH) do = —7 (S108) 90) 00 





f ZP(S | 0H) g(8) dd 


where J denotes summation with respect to all possible configurations of values of S, and 
the integral is taken with respect to 0. A similar analysis results when S is subject to a con- 
tinuous distribution law. The function P(@| SH) is then taken as a basis for estimating 0. 
For two values of 0, say 0, and 6,, @, is called more probable than 0, if P(@, | SH) > P(@,| SH). 
Needless to say, this comparison of posterior probabilities depends on the choice of g(@). 
Now, from the point of view of applying this discipline, g(@) would rarely, if ever, be known, 
and the controversy over inverse probability centres around the problem of choosing g(4). 
The author adopts two rules for selecting g(@): (1) If the parameter may have any value in 
a finite range, or from —0o to +00 its prior probability should be taken as uniformly dis- 
tributed. (2) If the parameter may conceivably have any value from 0 to +0, the prior 
probability of its logarithm should be taken as uniformly distributed. The adoption of these 
two rules appears to the reviewer to be extremely vulnerable. First of all, what does it mean, 
in general, for a parameter to be uniformly distributed on the interval —0o to +00? This 
question appears to be particularly in order since the author is performing the formal calculus 
of probabilities in exactly the same manner in which measure proponents calculate prob- 
abilities. They continually use the property that a finite total probability (taken arbitrarily 
as unity) is associated with each probability function. Presumably, meaning could be 
injected by carrying out the work for a finite interval — K to K and then taking the limit of 
the results or answers as K > oo. Similarly, it may be asked what it means to have the loga- 
rithm of a parameter uniformly distributed on a semi-infinite interval. Owing to the nature 
of the particular problems to which Jeffreys applies these rules, it happens that P(S | 0H) 
is such that formal difficulties of convergence do not arise in obtaining P(@| SH). For the 
finite interval, why should one choose the parameter to be uniformly distributed rather than 
the square or some other function of the parameter, or for the semi-infinite interval (0, 00) 
why should one choose the logarithm of the parameter rather than some other function to be 
uniformly distributed? It is easy to show that the assumption of uniform distribution is, in 
general, inconsistent with that of uniform distribution of any single-valued function of the 
parameter. 

Chapter tiv, entitled ‘Approximate Methods and Simplifications’, contains discussions, 
some of them rather heuristic, of the problem of estimation involved in such topics as 
maximum likelihood, least squares, errors due to grouping, rank correlation, contingency, 
artificial randomization, etc. The inverse probability approach is, of course, maintained 
throughout. 

The problem of significance tests is treated in Chapters v and vi. Jeffreys’ attitude toward 
significance tests follows along the lines of his concept of probability and consists in com- 
paring posterior probabilities. More specifically, suppose 9 is a parameter and it is desired to 
test the hypothesis that 6 = 0 on the basis of a given set of observations and an hypothesis H 
regarding the distribution law of S for given 6. Let g denote the hypothesis that 6 = 0, and 











194 Miscellanea 


g denote the hypothesis that @ has some other value. Jeffreys’ criterion for making the 
significance test is the ratio K of the two posterior probabilities P(q| SH) and P(qg| SH). 
The value of K itself, and not the probability integral of K under the null hypothesis q, is 
proposed as the criterion. Expressions for K are found for such problems as contingency, 
comparison of means and variances in samples, consistency of two Poisson parameters, 
correlation—problems which have already been treated by Fisher, Neyman, Pearson and 
others by other approaches free from inverse probability. In the treatment of all of these 
problems, the author arrives at four principal forms of K, which he proceeds to tabulate in an 
appendix for various values of sample size and for five grades of significance corresponding to 
K = 1, 10-#, 10-4, 10-3, 10-*. The last two chapters, i.e. vit and vii, are entitled ‘Frequency 
Definitions and Direct Methods’, and ‘General Questions’, respectively. These chapters 
are primarily philosophical excursions undertaken in an attempt to show that no existing 
definition of probability avoids the notion of ‘degrees of reasonable belief’, and to justify his 
own approach as well as his attitude toward inverse probability. The discussion is almost 
entirely informal and non-mathematical and as such it must be regarded in the category of 
personal opinion. 

The book lacks strict mathematical rigour in various places, but from the point of view 
of general flow of discussion it is interestingly written. It contains many keenly chosen 
quotations and side remarks charged with a delightfully subtle humour which has cha- 
racterized the author in other books. 

From a scientific point of view it is doubtful that there will be many scholars thoroughly 
familiar with the system of statistical inference initiated by R.A. Fisher and extended by 
J. Neyman, E. 8. Pearson, A. Wald and others who will abandon this system in favour of 
the one proposed by Jeffreys in which inverse probability plays the central role. 
PRINCETON 
UNIVERSITY 


S. S. WILKS 


(ii) A Bibliography of Human Morphology, 1914-1939. By Wiron M. Kroeman. 
United States of America: University of Chicago Press; Great Britain and Ireland: 
Cambridge University Press. 1941. Price 18s. 


The title of this volume may misiead to some extent. The 11,000 odd references in it were 
collected to aid physical anthropologists and the work will be of greater value to them than 
to other research workers, such as anatomists and geneticists, who are concerned with human 
morphology. The non-German literature is said to be covered more thoroughly than in the 
second edition of Rudolf Martin’s Lehrbuch, and there is no other cornprehensive bibliography 
of the subject for the period since 1928. It is not claimed that the list is exhaustive, and the 
most stringent selection appears to have been made in the section on blood groups for which 
fuller bibliographies are available. 


G.M.M. 


(iii) A Property of the Distribution of Excremes 


By H. E. DANIELS 


Wool Industries Research Association 


If the chance of an observation being less than w is P, then P" is the chance that the greatest 
of a random sample of n is less than x. The constants of the distribution of the greatest of a 





Zz 
sample in the important case when P = ( ete? have been calculated by Tippett (1925) 


dx 
J —a@ (27) 
for values of n up to 1000, and in a paper in which all possible limiting forms of P* are dis- 
cussed, Fisher & Tippett (1928) give limiting formulae from which approximate values of 
the constants are calculated for large samples. 


Miscellanea 195 


In the present note attention is drawn to a curious approximate relation connecting the 
mean M and standard deviation o of this distribution which holds with high accuracy for 
all values of n. It was arrived at empirically and appears to have no obvious mathematical 

ey = Gecnese 2 
derivation. The formula is M = 2cot jae. 
The values of 2cot $7@, calculated from Tippett’s figures, are compared in Table 1 with 
Tippett’s values of M. The greatest discrepancy, 6 = M — 2 cot 370, over the range of n up to 
1000 occurs at n = 10 where it is no more than about 14%. For n greater than 1000 the 


$ 


enultimate limiting values given in Table A of Fisher & Tippett’s paper are used in Table 2 
yenult te limiting values given in Table A of Fisher & Tippett’s paper are u Table 2 
and the discrepancies are again found to be small. The most serious is of the 


when n is 7228, but their Table B suggests that the penultimate limiting vah 





are probably underestimated and as 2 cot 4z¢ is fairly sensitive to changes 


of o = 0-3, the real discrepancy may perhaps be smaller. 


Table 1 





n M a 2 cot dua | 6 = M—2 cot 4x0 
1 0-0000 1-0000 
2 0-5642 0-8257 
5 1-163¢ 0-6690 
10 1-5388 
20 1-8675 
60 2-3193 —0-0107 
100 2-5076 —0-0064 
200 2-746 —0-0012 
500 3-0367 0-0040 
1000 3-2414 








“0061 











Table 2. Penultimai 
| ’ M Cc 2 cot § = M—2 cotine 
| 7 . j a os 
7228 3-766 3-8661 0-0964 
637 x 10° 4-771 48311 0-0592 





264 x 10° 6°9262 0-1787 6-9369 0-0107 








Fisher and Tippett show that in the ultimate limiting form of the distribution the mode 
m, mean M and standard deviation ¢ are related by the formulax 
M = 9 /¢ o? = in? 
where c = m/(m*+-1) and y = 0-577216 is Euler’s constant. Consequently 
i | l \ l 71 l 
M = ~~ JF l }+) = = 
9 / \4 ( o ./€ 5 


as @ becomes small with increasing n. On the other hand, our approximate relation gi 





gives 
aa > = 1-27 +4 
} = Z2cot T~ > = 
70 Cc 
The error at n = 00 is thus seen to be less than 1% 


Tippett, L. H. C. (1925). Biometrika, 17, 364. 


a, 
FIsHER, R. A. & Trepert, L. H. C. (1928). Proc. Camb. Phil. Soc. 24, 180. 


\ 











196 Miscellanea 


(iv) Proof of Relations connected with the Tetrachoric Series and 
its Generalization 


By M. G. KENDALL 


If a bivariate normal distribution F with variates 2, and x, and correlation p is doubly 


diehotomized at x, = h, x, = k, and 
wo fo 
d =f J dF, 
hJdk 


@ 
it is known that d= > p'r,(h)7,(k), (1) 
r=0 
where 7, is the rth tetrachoric function defined by 


H,_,(x) f(x) 
—“ (2) 


T(x) = 
1 : 
and f(x) being the function am) e-t=* and H,(x) the rth Hermite polynomial defined by 


d r 
H, f(z) = (-3) f(x) = (— Dy f(z). (3) 


The purpose of this note is to present a simple proof of this result,* to prove that the 
series of equation (1) is convergent for | p| <1 and to generalize the series to the case of the 
multivariate normal distribution.t 

The characteristic function of 

1 { ls 2)\ 
P= Safi — pp PHI —py Peta 


e.2) io) 
is, by definition, (t,, te) =| f exp fit, x, +it,z,} dF, 
—-aJ -@ 
and is easily seen by direct integration to be equal to 


exp{— Hii + 2pht, + 4)}. (4) 
We have then, for all finite ¢,, t,, 
tt, 


r!- 


wo foo l ce) mo) ie) roe) 
Now d =| J dF = —— { ar,| a, ) P(t, ¢t,) exp { — tt, x, — it, x4} dt, dt. 
ad k (27)? Jn k -oJ —o 


Substituting for ¢(t,,¢,) we have, for the coefficient of (—)"/r! in this expression, 


1 2) 3) co (Poo 
af i) ax, { ax, | exp (— $#) exp (— 44) {Gj exp{—it,x,-it,z,}dt,dt,, (5) 
7 h Jk —OJ —w 


P(ty, tg) = exp (— $t]) exp (— 34) z I —p) 
r= 


and this is the product of two integrals, the first of which is 


l «o 3) 
. f ax, f exp (— $t}) {exp {— it, 2,} dt, (6) 
27 h - 

and the second of which is a similar expression in x, and t,. 


* The expansion (1) appears to have been given for the first time by G. Mehler, ‘Reihew- 
entwicklung nach Laplaceschen Functionen héherer Ordnung’, J. reine angew. Math. 66, 161. 
t See Note at end of paper. 


Miscellanea 197 


nn or a 
Now, since | exp ( — $f?) ve-*#** dt = ——— ( exp (— 4#*) e~** dt 
diy a —tzy” J ow 


. 


= «/(27) i7Dre-i* 


= 2n(—i) H,(x) f(x), (7) 
(6) is equal to b<. iy [ae H,(x) f(x) = (—i)" H,_,(h) f(h), 
and thus (5) is equal to (—1)'H,_,(h) f(h) H,_4(k) f(), 
and hence d= EP H,_4(h) f(h) Hy a(k) fl) 
= Zp'r,(h)7,(k), (8) 


the tetrachoric series. 
Now for the convergence of the series, consider 


| T-(A) (he) | = (7!) | H,_,(h) f(h) A, a(h) f(k). 





1/2 : | 
From (6) | H,4(h) f(h) | < on | exp (— $42) t-te ** di | 
27| J —a | 
l io) 
<- ( exp (— 32°) t"-1 dt 
7™J0 
=3 
9 2 iy 
<= (5) 
r 
gr-2 72 ) 
(3 
Hence | 7,(h) 7,(%) | < 
ar! 
Dr-2 e{r-2) Da r—2\"" 
2 
<~ 
7 {(27) e-tr7 +4 
l 
y(2m) rt 
Thus the tetrachoric series converges for | p | <1, though possibly slowly near | p| = 1. 
Now consider the general multivariate normal distribution 
id . l iy 24957 | 
dF = —z 1°%P-sp {2 R552; + 2OR;y,.x;x,} dx, ...dxp, 
(27)? R? 
where R= | 1 Piz Pin 
| Pix Pan 
] 
Pin Pen led 1 


and R,, is the minor of the jth row and kth column. 
The characteristic function is 


Heys ste) = | “nm dF exp (Zit,x,) dx, ...dxq. 











198 Miscellanea 


To evaluate this integral make the transformation 


os 
t= Danby 
k 


and choose the @’s so as to reduce the second degree terms in £ in the exponent to the canonical 
form D€?. The remaining terms in ¢ and & will obviously be linear in each. We may then 
make a further transformation £’ =  — (linear function of the #’s) and the remaining terms, 
apart from 2&’?, will be a quadratic in the #’s, and equal, say, to 


Dy tf + 2ZG8jpt;ty. 


rr 


The integration abolishes the terms in &’ and we find that the characteristic function is 
proportional 7 = 
. 226 ;,.t;¢,}. 


Putting all but two of the ¢’s zero we see by comparison with (4) that the terms in #,, ¢, 

is — }(#? + ¢£+ 2p,;,t,¢,) and thus the characteristic function is 
. f yy2 oT, 2 10 
exp {— 4(24 4 LP iptsty)} (9 


rhe generalized tetrachoric expansion can be obtained by an expansion of (9) in terms 


of the p’s and the application of the foregoing procedure. For instance, with three variates 





> ¥ = a \ ; 
exp a i, 3 !ats + Pig ly ls)" 
ki gk+ 
. l . oy 
and on integration 
‘ ig ‘ds 
i ; 
=} } Lf 
J hyd h 
z 
sf 33 } s 1 ry , 
nist 144% -1(/ p (fy A 541 4(hg f(hg Tpit (gs) JU 
pines 
hich will alan he f a oy oe 
which Will aiso be found to be convergent 
[Note. Since writing this paper, under the impression that the re t 
indebted to Dr A.‘ tken pointing out that similar results have been 
lecture for I per Of years Am ng published work, reference may | ad 


. ] 


f > yr Tm } 1}? 7 ‘ f sy na al 
(a) P. 175 of Aitken & Turnbull’s Theory of Canonical 





ices (Blackie, 1931), where 
& more direct method of deriving the characteristic function of the multivariate normal 


distribution has been giv 





(6) A paper on ‘Fourfold replacement’ by A.C. Aitken & 
H.T. Gonin (1935, Proc. R the tetrachoric expansions ar 
discussed and new series as binomial and the correlated hyper- 
geometric distributions are derived. : 

As the work of ti Edin >urgh school on this suk ject may not 1owever, be generally 
familiar, the Editor has suggested that this short paper should be published, together y ith 
the foregoing referen in order to bring the recent d lop e a widk t tical 


Miscellanea 199 
(v) The Cumulants of the Distribution of the Square of a Variate 


By J. B. S. HALDANE, F.RB:S. 





n The foHowing problem has arisen in several biometric iz igations. The cumulants of the 
3, distribution of x are known, and it is desired to find the cumulants of the distribution of z*. 





As this problem is likely to arise in future, it.seems desirable to give the appropriate trans- 
formations for the first few cumulants. 
Let K,, Kg, Ks, ... be the cumulants of z. 


Let /1,, /4o, 3, --. be the moments of z? about zero. 


Let fe, ft, ftg, ... be the moments of x* about its mean. 
"29 F°3 4 




















be , , 
Let x Ko Ky, -.- DO the cumulants of x*, 
3 Then #, is the 2rth moment of x. These have been given in terms of the curnulants up to 
the 10th, i.e. 4;, in the general case by Kendall (1940), and up to the 12th, i.e. 44, by Haldane 
: (1938) when x, = 0. We consider the general case first. We have such expressions as 
+ 3K3+K 
ror i ve e the cumulants. The results are: 
ef — pag 
i 
k . + 2(2K, Ks K2 K } 
> 9 : } 
. ),-2 {2 1D) o. or9 7 —— . j 
g = SKK, Kg + JKg) + S( 0K] Ky + LAK KA 2Kq) + 2(3K,X; + 6K_kK, + 5x3) a | 
Ks }- 12K3) (1) 
9 . } 
“ee Law x ev ok.) 
32K, 18k? x, + 3¢ Ks 
' 
BK, Ky + Ko K iKgf tk | 
A t the expressions b o 4 \ K 0, ix has it o, most 
oO © % ms vanish L we 
’ A Ko 
Ka = 2 " i 
2 i 
; 
K, = | 
I i 
| 
Ks | 
j 
K; = K3X5 8 BK3K, r (2) 
K,= + 14x23, Ks +§ Bs DSK. + Sx$) | 
KeKgX_ t+ 2L0K3%_ + 189K, x; +- € | 
g 3X F 
IK x Oiie- wo ot - ; 
+ 4(L5K ky 9 + 55K gKy + 120K, Ky + LOSKgK, + 113K3) + Ky, | 








200 Miscellanea 


Finally, if z be symmetrically distributed, so that all its odd cumulants vanish, 
Ki = Ky ) 
Ke = 2x3 +k, 
Ks = 83+ 12kgkKy+ Ke, 
Ky = 48x$+ 144K3 K, + 8(3k_K_ + 4K7) + Ks, ‘ (3) 
Ks = 384x5 + 1920K3K, + 160K2(3k_ x, + 8x2) + 40(Kg Kg + 5Ky Ks) + Kt0> 
Ki = 3840x3 + 28800K4x, + 9600K3(K, Kg + 442) + 240(5K2 Ky + 50K, Ky Ky + 2243) 

+ 4(15k, ky) + 120K, Ky + 113K3) + Kyp- 





I have bracketed together terms which are products of the same number of x,’s. If xis 
a linear function of observed numbers in a sample of n, every x, is proportional to x, so the 
terms in brackets will all be multiples of the same power of n. 


REFERENCES 


Haupaneg, J. B.S. (1938). ‘The first six moments of x? for an n-fold table with n degrees 
of freedom when some expectations are small.’ Biometrika, 29, 389-91. 

Kenpatt, M. G. (1940). ‘The derivation of multivariate sampling formulae from uni- 
variate formulae by symbolic operation.’ Ann. Hugen., Lond., 10, 392-402. 


(vi) Numerical approximations to the percentage points of 
the x’ distribution 


By MAXINE MERRINGTON 


The use of two approximate formulae has been suggested for calculating a percentage 
point, x3(P), of the x? distribution, corresponding to v degrees of freedom and a probability 
level P.* Both formulae involve the use of the standardized normal deviate, yp, corre- 
sponding to the value of P chosen. 


(1) R. A. Fisher’s (1925) formula: 


XP) = Hye + M(2v— 1), (a) 
which assumes that ,/(2x*) is normally distributed about .(2v—1) with unit standard 
deviation. 


(2) Z. B. Wilson & M..M. Hilferty’s (1931) formula: 
u(P) = »[1-= + =) b 
, 9v YP ov} ” (0) 


which assumes that (x?/v)* is normally distributed about 1—2/(9v) with a standard devia- 
tion of ,/(2/9v). 

Professor Pearson has suggested that I should prepare a comparative table showing 
for certain v and P the numerical values of x3(P): 

(a) calculated from Fisher’s formula, 

(6) calculated from Wilson & Hilferty’s formula, 

(c) the correct values taken from Miss Thompson’s table (pp. 188—9 above). 


* The notation used is that adopted in the paper on pp. 187-91 above. 


Commnarative Table of Percentage Points 


201 


Miscellanea 


‘91q84 8,uosdm0yy, ser Worg (9) “epnuMos s Aqos TH “WW “WP UOSTLM “A “A 





worg (9) *B[NULIO; s,Joyusiy “Y “YW Mol (p) 











E89L9° 


GO0E-99Z 
OLZF96 


O8€'86T 
8ge-Lé6l 


69T-0FT 
€61-0FT 
PST -6ET 


98¢-01T 
€1é-Oll 
69¢°60T 


00676 
€29°6L 
LbP-SL 


6291-99 
608-99 
SIL-S9 


0ZL9-€9 
€IL-eo 
£09°39 



























































| 4 aan 
| egoze-s | osprg-1 | g918z-1 | 6>FL9-0 | 0000-0 | 6FFL9-0—- | so18z-1— | 9869-1 — | 98988-3— iiiais' “h 
| | | 
os eas pone oxnine pom | — | — quae oa anne | (2) 
Qor-6F% | 866-882 | L10-926 | 660-12 | FES-66T |  SLT-98T | SE8-FLI | GLZ-891 | IZhO9T | FBs-s9T | (9) 
CLO-RFZ | GOL-EEZ | 026-926 | 0081S | 009-66T | S9-98T | SZL-PLT | LOGLOT | LEL-SOT | S9S-19T | (2) 008 
| 
~ — — —_ ae ~ — _ — (9) 
BIZ-61 | GLE-6LI | LL9-ZLT | 88z-191 | FEE-GFI | LEG-LEL | SLZ-8ZI | G69-ZZI | I99-GIT | GsI-60T (9) 
ZEP-OI | G6S-GLT | ISHSLT | O6E-19T | 00S-6FI | H9O-RET | I9T-8ET I1¥-BZI | O86-TIT | LL2-801 | (%) OST 
| | 
108-981 | ZPe-rZT | S6FSIT | IFT-601 | IFe8-66 | SEST-06 | I89E-Z8 | 96Z6-LL | 8F90-0L | OLGE-LO (9) 
OZ8-SEI | OFE-FZI | S6F-SIT | LET-GOI | 98-66 | 861-06 698-38 6Z6°LL 6F0-0L 608-19 | (9) 
EZ0-SET | 9SO-FZI | OOF-BIT | ZhZ-GOT | 009-66 | $12-06 £F3-28 6F9°LL 688-69 18%-99 |. (?) OOT 
£66901 | 112-96 Z90°16 | Z898-28 | SPee-FL | LOTH-99 | FRGL-6S | OFSO-99 | SFLF-6F | 690G-L¥ (9) 
80F-901 | FIZ-96 | 990-16 €28-28 CEE-PL BSF-99 661-69 690-99 LOP-6F SLI-LE | (9) 
£09-S0I | 186-96 996-06 [96-28 009-FL $6>-99 | 819-69 GLL-SS 608-8h | GLe-9F | (P) GL 
| | 
6EST-9L | SFOS-LO | ILOT-€9 | 9EE-99 | BHEE-6F | 1GF6-CF | 9gso-ze | 2rOL-re | L90L-6% | 4066-42 | (?) 
SLI-OL 109°L9 621-89 8ZE-99 9£8-6F 6F6-3F €69'LE | SOL-FE 989-6 L96-LZ | (9) 
€98-SL 61Z-L9 3L0-€9 6EF-99 009-6F 910-8F OLG-LE | LBP-HE 690°62 | 8ssI-Le | (7) 09 
| 
1069-9 | ssgL-g¢ | 0908-19 | O9TO-Sh | H9EE-6E | £099-EE | 9090-66 | 609-92 | EF9T-2Z | 990L:06 | (9) 
OLL-€9 SgL-9¢ 96L°19 O19-SF LEE-6E 899-8 990-6 | 809-92 681-22 | 699-06 | (9) 
| €88-29 ELP-Ge SIL-Tg ZSL-SP 009-68 SELES 186-83 | £83-9 629-13 | £2661 (») OF 
| | 
| gzes-og | ezzL-er | o99z-0F | se6L-re | 09EE-6% | DLLF-FZ | 266-02 | 9ZEF-BI | SEIGFT | LOSL-€I | (9) 
F16-09 LOL-Sh | 9b6-0F C6L-FE | 888-62 O8F-FZ $09-02 | 16-81 926-1 | PPL-eI | (9) 
GL0-09 L8e-Sh | G91-0F 806-8 | 009-6Z LPG-bZ LL¥-O@ =| 8IB-81 LEg-FI | Zeo-et | (P) 08 
a | is sal 
| | ,* 
| a 
010-0 090-0 001-0 093-0 009-0 O¢L-0 006-0 | 096-0 066-0 | 966-0 \ 
ae a ee | 
smog abmuasiag fo 4qQn,T aavyosdumog 
= oO 


~_— ryt 











202 Miscellanea 


The comparison is shown in the preceding table.* It will be seen that: 

(1) For the range of values of v and P considered, the formula (6) is consistently more 
accurate than (a); the greatest absolute value of the error is about 0-04 for (5) while at the 
tails of the distribution it amounts to over 1-00 for (a). 

(2) Formula (6) is extremely accurate in the neighbourhood of the two 5 
(P = 0-950 and 0-050). 

(3) For neither formula do the actual error values decrease very much as py increases 
but, since x* increases with v, the relative error values do, of course, decrease. 


% points 


REFERENCES 
Fisuer, R. A. (1925). Statistical Methods for Research Workers. Edinburgh: Oliver and 
Boyd. 
Garwoop, F. (1936). Biometrika, 28, 441. 
Witson, E. B. & Hinrerty, M. M. (1931). Proc. Nat. Acad. Sci., Wash., 17, 684. 
* An earlior comparison of the approximations has been made by F. Garwood (1936). 


(All Rights reserved) 
BIOMETRIKA. Vol. XXXII, Part II 
CONTENTS 


PAGE 
The laws of chance, in.relation to thought and conduct. An introductory lecture delivered by 
Kart PEARSON in 1892 ° . . ° ° ° ° ° . ° ° - - 89—100 


Medical statistics from Graunt to Farr. Part I. By Mason GREENWOOD . ‘ . . - 101—127 
Fiducial argument and the theory of confidence intervals. By J. Neyman . A ° - 128—150 


Tables of percentage points of the Incomplete Beta Function. Computed by CaTrHErmne M. 
THOMPSON 


Prefatory Note. By E. 8. Pzarson . ° . . . A ° : ° . 151—153 


Description of the calculation and methods of interpolation. By L. J. Comam and H. O. 
HARTLEY 


Tables . ‘ ; . 


Pag | AGRA REN aS > Git Reg ae sg a" ig 
CORE OTE ee ie ee ene we a a 


Table of Lagrangian coefficients for harmonic interpolation in certain tables of in Resta 
Prepared by L. J. Comer: and H. O. Harrizy . : . . . ‘ - 183—186 


Table of percentage points of the x? distribution. Calculated by CarHmrivse M. THompson . 187—191 
MISCELLANEA : 

(i) Review of Harotp Jerrreys’ Theory of Probability. By S. 8. Wmxs . - 192—194 
(ii) Review of Witton M. Kroeman’s A Bibliography of Human Morphology, 1914-1939 194 
(iii) A property of the distribution of extremes. By H. E. Danrets : . - 194—195 


(iv) Proof of relations connected with the Tetrachoric Series and its generalization. By 1 M. G. 
KENDALL . . ° . ° . ‘ ° ° ° . . - 196—198 


(v) The cumulants of the distribution of the square of a variate. By J. B. S. Hatpane . - 199—200 


(vi) Numerical are tt to the percentage ese of the x? distribution. By Maxine 
MERRINGTON ° ° ° ° ° ° . ° ° - 200—202 


A volume of Biometrika containing about 400 pages, with plates and tables, is normally issued annually. Owing to war 
conditions, however, delay in issue is inevitable. 

Papers for publication should either be sent to 

PROFESSOR E. 8. PEARSON, Department of Statistics, University College, London, W.C.1, 
or if more-convenient may be submitted through a member of the Editorial Committee, viz. 
Prorsessor Haraup Cramér, University of Stockholm, Sweden. 
Dr R. C. Geary, Statistics Branch, Department of Industry and Commerce, Dublin. 
Prorzssor M. GREENWOOD, F.R.S., London School of Hygiene and Tropical Medicine, London, W.C.1. 
Proressor J. B. 8. Hatpanz, F.R.S., University College, London, W.C. 1. 
Dr G. M. Morant, University College, London, W.C.1. 
Dr Jonn WisHakt, School of Agriculture, Cambridge. 

It is a condition of publication in Biometrika that the paper shall not already have been issued elsewhere, and will not be 
reprinted without leave of the Editors. 

Contributors receive 25 copies of their papers free. Joint authors 15 copies each. The price of additional copies which | 
should be ordered when the author’s proof is returned will depend upon the number of pages, plates, etc. 

The subscription price, payable in advance, is Inland 45s. net per volume and Abroad 54s. net (including packing and 
postage). Owing to the scarcity of early volumes, the following rates must now be charged for complete sets. Vols. I—XXXI, 
including XX*®: £128 in buckram, £117 in wrappers, not including postage. Recent volumes may still be obtained at 
the wrapper price; this is 64s. inland, including postage. Standard buckram cases with Darwin block, price 3s. 6d. + the 
postage per volume. Index to Vols. I to V, 2s. net. Index to Vols. I to XV, 5s. net. Cheques must be made payable 
to Biometrika and sent to The Secretary, Biometrika Office, Department of Statistics, University College, London, W.C.1, 
to whom all orders for series, single copies and offprints should be addressed. All cheques must be properly stamped and 


should be crossed “a/c Biometrika Trust”. No foreign cheques can be accepted unless they are drawn in sterling and 
payable at a London agency. 


First Printed in Great Britain at the University Press, Cambridge 
Reprinted by offset-litho by Perey Lund Humphries & Co., Lid., Bradford 








